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FUNCTIONS OF PARTING CEREMONIES 
IN DYING ORGANIZATIONS 


STANLEY G. HARRIS 
University of Michigan 


ROBERT I. SUTTON 
Stanford University 


This paper advances a theory about the functions that parting ceremonies 
serve for members displaced by organizational deaths. The theory is 
grounded in data from 11 parties, picnics, and dinners that occurred 
in six dying organizations. We propose that parting ceremonies create 
settings in which people can exchange emotional support and can edit 
self-, social system, and event schemata. We also assert that such gather- 
ings provide opportunities for managers to influence the course of or- 
ganizational demise. 


“The wake is to be an occasion to remember the vigor and charm of the departing 
spirit. We are interested in recalling and sharing memories of the place with those 
who were its friends and who may have benefited from their association with it over 
the years. If you cannot attend, please consider sending a message perhaps containing 
an anecdote you remember with pleasure." 

—An invitation to a “wake” for a dying organization. 


This article describes the attributes and functions of a set of ceremonies 
that we observed in an investigation of dying organizations.! We were struck 
by the prevalence of parties, picnics, and other social occasions during the 
final phases of organizational death. We labeled these events parting 
ceremonies. 

Ceremonies are “relatively elaborate, dramatic, planned sets of activities 
that consolidate various forms of cultural expressions into one event, which 
is carried out through social interactions” (Trice & Beyer, 1984: 655). Turner 
(1977) asserted that such ritualistic behaviors arise when the structure of 


An earlier version of this manuscript was presented at the 1984 meeting of the Academy of 
Management as part of the “Sense-making in Organizations" symposium co-sponsored by the 
Organization and Management Theory and the Organizational Behavior divisions, We wish to 
thank Daniel Denison, Nancy Harris, Robert Kahn, and Karl Weick for their helpful comments 
on this manuscript and Reginald Bruce and Susan Schurman for their help in collecting these 
data. We would also like to thank the people who took the time to talk with us about their 
experiences with organizational death. 

1 The use of the death metaphor is intentional] and serves to remind us that organizations are 
living and dying systems (Miller, 1978) and that there is a strong similarity between the losses 
people experience at the death of an organization and of a friend. The analysis we present in the 
following pages has benefited from the mutation of the death metaphor (Weick, 1979). See 
Sutton (1984) for further discussion of this metaphor. 
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social relations is threatened. Organizational death causes members and 
other participants to lose a shared social arena and thus poses a profound 
threat to the structure of social relations. As a result, an array of rituals and 
ceremonies characterize dying organizations, including collective bargaining, 
disposing of property, making public announcements, and severing ties with 
clients and suppliers. We seek here to propose a theory about one of these 
activities—the parting ceremony. 

Parting ceremonies are formal social events that arise in response to the 
imminent breaking of social bonds. Like other rites of passage (Van Gennep, 
1960), parting ceremonies help separate members from dying organizations 
and to a lesser extent facilitate their transition and integration into new 
roles. The stated purpose of such gatherings is usually to offer a final oppor- 
tunity for everyone affected to meet as a group. Examples of parting ceremo- 
nies outside the domain of organizational death include high school gradua- 
tion exercises, retirement parties, funerals, wakes, and going-away parties. 

Parting during organizational death can be accomplished through a variety 
of socially approved activities. Such parting activities vary on at least two 
dimensions—-formality and number of participants. For example, partings 
that occur between two friends are usually informal. In contrast, the parting 
ceremonies described here are relatively formal because someone plans and 
sponsors them, and they include more people. 

Parting ceremonies appear to be prevalent in organizational death; they 
were a dominant theme in six of the eight closings we studied. Nonetheless, 
the modest literature on ceremonies in organizations (e.g., Deal & Kennedy, 
1982; Trice, 1985; Trice, Belasco, & Alutto, 1969; Trice & Beyer, 1984) does 
not address parting ceremonies. Trice and Beyer observed that no rites or 
ceremonials were “found at the organizational level that have the manifest 
intent of dissolving existing social arrangements without degrading their 
occupants” (1984: 667). This study offers evidence of the existence of such 
ceremonies. 

We sought to develop a theory about parting ceremonies because they 
appear to serve important functions during organizational death. Moreover, 
we found parting ceremonies to be among the most interesting aspects of 
organizational death. As Davis (1971) made clear, interest may be the most 
important quality that a theory can possess. 

Studying such parting ceremonies also enhances knowledge about the 
process of organizational death. Death appears to be a common occurrence 
in all populations of organizations. Starbuck and Nystrom (1981), for example, 
presented evidence that 45 percent of American corporations die within 5 
years of birth, and 90 percent die within 20 years. They also reported that 
federal agencies die at a similar rate. 

Organizational theorists offer an array of causal explanations for this 
common transformation. The literatures on organizational effectiveness 
(Cameron & Whetten, 1982; Goodman, Pennings, & Associates, 1977), open 
systems (Katz & Kahn, 1978), and population ecology (Hannan & Freeman, 
1984; McKelvey, 1982), for example, predict when and explain why the 
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event of organizational death occurs. In contrast, little has been written 
about the process of organizational death. Although some literature has 
explored conceptual issues that arise when industrial plants close (Hardy, 
1982; Taber, Walsh, & Cooke, 1979), most existing descriptions of organiza- 
tional death are journalistic rather than theoretical (e.g., Nance, 1984; Slote, 
1969). Parting ceremonies provide a useful vantage point from which to 
study this ubiquitous social process. 

The theory proposed here emphasizes the positive consequences of part- 
ing ceremonies; thus it is in the functional tradition (Trice, 1985). We view 
organizations as living and dying systems, which is also consistent with the 
organismic metaphor characteristic of the functional approach (Miller, 1985; 
Morgan, Frost, & Pondy, 1983). The functional perspective suggests that 
patterned behavior contributes to the maintenance and the survival of social 
systems (Trice, 1985). Specifically, the present theory-building effort consid- 
ers several functions that parting ceremonies serve. These include maintain- 
ing and promoting (1) individual well-being, (2) group cohesion, (3) task per- 
formance, and (4) positive relations with external stakeholders. 

We sought data about both the functions and dysfunctions of parting 
ceremonies, but the theory proposed here emphasizes the benefits rather 
than the hazards of such ceremonies. We did not incorporate the dysfunc- 
tions of parting ceremonies into the theory because our data and other descrip- 
tions of parting ceremonies (Slote, 1969) suggest that such social gatherings 
have primarily functional consequences. 

We acknowledge that we encountered some evidence of dysfunctions. 
Moreover, other behaviors displayed during parting ceremonies appear to 
serve no function at all. The neglect of dysfunctions does not invalidate the 
theory offered here, because the presence of functional consequences is not 
logically incompatible with the presence of dysfunctional consequences. Yet 
such neglect does render the theory less than complete. As Weick (1979) 
pointed out, all organizational theories suffer from this malady because of 
the complexity of social phenomena. 

In their codification of rites, Trice and Beyer (1984) distinguished between 
those with manifest consequences for individuals and those with manifest 
consequences for groups or organizations. Similarly, we separately discuss 
functions that parting ceremonies serve for (1) individuals, (2) organizations, 
and (3) external stakeholders. However, the theory offered here emphasizes 
the functions that parting ceremonies serve for displaced organizational 
members. 

Our research was inductive. Following the logic of induction, we describe 
the research first and propose our theory second. 


METHODS 
Organizations 


This research was conducted in southeastern Michigan from the sum- 
mer of 1982 through late spring of 1983. Michigan was experiencing a deep 
economic recession during this period and thus served as a natural laboratory 
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for studying organizational death. The investigation of parting ceremonies 
described here was part of a larger study about the process of organizational 
death (Sutton, 1984). 

The sampling strategy for this study followed the example of Lawrence 
and Lorsch’s (1967) classic research on environment and organization 
(Lawrence, 1981). We deliberately, but not randomly, selected a diverse - 
sample of organizational deaths. Such variation was needed to help generate 
theory about both constant elements that arise in all or almost all organiza- 
tional deaths and variable elements that explain differences among organiza- 
tional death processes (Sutton, 1984). 

Our efforts to develop theory about parting ceremonies focused on only 
constant elements. Nonetheless, the varied sample served to strengthen 
theory-building about constant elements that we proposed are present in all, 
or nearly all, parting ceremonies. Similarities observed across a diverse sam- 
ple offer firmer grounding for such propositions than constant elements 
observed in a homogeneous sample of organizations. We sought a sample of 
organizational deaths that would fill four theoretically derived categories 
(see Table 1). 

The final sample included the following: in category 1, private sector 
organizations with no parent organizations, a small automobile company 





TABLE 1 
Core Cases of Organizational Death 
Age 
Organization - Size? (Years) Informants^ 
Private sector 

Independent Small automobile company 30 Di 2 

1 
Sporting goods store 4 1 1 

3 
Dependent on a parent Hospital 180 46 8 

organization 135 
Retail store H 9 4 

? 
Large auto plant 550 25 8 

10 

Public sector 

Freestanding facility Center for the disabled 374 20 8 

13 
Part of a larger facility Academic unit 14> 58 8 

13 
Research organization 30 16 7 

18 





* The top number indicates how many employees worked for the defunct organization one 
year prior to its closing; the bottom figure indicates the number one month before the closing. 
These data are missing for the large department store. 

5 These numbers refer to faculty members only; support staff and graduate students are ex- 
cluded. SR 

© Note that while the total here is 46 informants, this actually represents 44 individuals. Two 
individuals, members of a shared parent organization, acted as decision makers for two distinct 
closings. 
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and a small sporting goods store; in category 2, private sector organizations 
dependent on a parent organization, a hospital, a large retail store, and a 
large auto plant; in category 3, public sector, freestanding facilities, a center 
for the disabled; and in category 4, public sector organizations embedded in 
larger facilities, an academic unit and a research organization. 

In addition to variation in technology, sector, and dependence, Table 1 
reveals much variation in organizational age (1 to 58 years) and number of 
members (4 to 550) within this small sample. 

Three additional criteria guided the selection of cases. First, organiza- 
tions had to be in the process of closing or have been closed no more than 
one year before data gathering began. Second, we had to be allowed to study 
the death. Management had to grant permission if an organization were still 
alive. For organizations that were defunct, we contacted displaced members. 
Cases were included if enough key individuals were willing to discuss a 
closing. We contacted members of 20 organizations before reaching our target, 
a sample of eight organizational deaths. 

Finally, informants had to agree that there was no chance that their 
organizations would start activities again in the future. Following living 
systems theory (Katz & Kahn, 1978; Miller, 1978), we operationally defined 
organizations as dead when their input-throughput-output cycles had ceased 
permanently. 


Informants 


People who made and implemented decisions about an organization’s 
closing were our primary informants. We selected and tried to interview the 
head of each organization and other key decision makers including human 
resource managers, marketing managers, and operations officers. We also 
sought to interview decision makers from parent organizations and union 
leaders when such individuals existed. Moreover, we interviewed infor- 
mants who were not decision makers when they had essential information. 

The perspectives of decision makers were emphasized because they usu- 
ally have high network centrality (Tichy, 1981: 229). Compared to other 
members, decision makers typically have structural connections to the great- 
est numbers of people in organizations, receive the most information, and 
send the most information. They are also most likely to have access to 
organizational records. Moreover, they are usually the last people to be 
displaced. Thus, decision makers are most likely to have observed all phases 
of organizational death, including parting ceremonies. 

The number of informants interviewed for each closing (see Table 1) 
followed the notion of “theoretical saturation" suggested by Glaser and Strauss 
(1967: 61—62); we stopped interviewing additional people when we began 
hearing the same stories repeated over and over. We invited 55 people to 
participate in the study. A final sample of 44 agreed to participate. Of these 
44 informants, 26 were members of the dying organizations studied and 18 
were nonmembers. Nonmembers included decision makers from parent 
organizations and union leaders. 
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Data Sources 


The case descriptions of the eight organizational deaths are based primar- 
ily on responses to semistructured interviews and on archival data. Each 
interview began with questions about the norms and history of an informant’s 
organization. The next questions concerned the early stages of the organi- 
zation’s death including rumors, the struggle to save the organization, the 
decision to close it, and the announcement of the closing. Next were ques- 
tions about the events during the process of disbanding the organization. 

The remainder of the interview concerned the final days and hours of 
the organizational closing, and the psychological and economic effects of the 
closing on various stakeholders. This section included questions about the 
existence and occurrence of parting ceremonies. Moreover, since this research 
was inductive, we encouraged interviewers to ask questions about lines of 
discussion that deviated from the prepared questions. 

Most interviews (40 of 44) were tape-recorded and transcribed. The length 
of these contrived conversations varied from 45 minutes to over four hours; a 
typical interview was about 90 minutes. Interviews usually took only one 
session, but occasionally researchers or informants requested more sessions. 
For example, we conducted an additional interview with the head of the 
center for the disabled to learn more about a trio of parting ceremonies. 

Access to records produced during the organizational deaths was consid- 
ered essential. We asked each informant if any records existed about a closing, 
where they were kept, and if we could have permission to copy the 
documents. We gathered useful data from such records as letters, lay-off 
lists, budgets, and newspaper stories for all the organizations except the 
small sporting goods store. For instance, the quote that introduces this paper 
came from an invitation to a “wake” for the academic department. 


Analysis Strategy 


The method used to develop theory about parting ceremonies drew on 
existing descriptions of how to generate theory from qualitative data, particu- 
larly writings by Glaser and Strauss (1967), Mintzberg (1979), and Miles 
(1979). In general, our method entailed continually comparing theory and 
data until we had developed adequate conceptual categories. 

Specifically, efforts to develop a theory about parting ceremonies went 
on within the larger study of which this research was a part. Before and 
during data collection, we met every week to discuss the emerging theory. 
Our earliest meetings focused on constructing a rough working framework. 
We heeded Miles’s (1979) warning that “the need to develop grounded theory 
exists in tension with the need for clarity and focus; research projects that 
pretend to come to the study with no assumptions usually encounter much 
difficulty” (1979: 119). 

Subsequent meetings entailed comparing the emerging framework to 
new evidence from the field study and the organizational studies literature; 
adjusting the framework to allow for new facts and new ideas; and planning 
to collect additional evidence. One week, for example, we noted that former 
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members of dying organizations usually receive invitations to parting 
ceremonies. We developed hypotheses about the functions such invitations 
serve. Plans were made to call back an informant and ask him if former 
members of his defunct organization had been invited to parting ceremonies. 
Finally, we decided to urge the other two members of the research team to 
ask informants about such invitations. 

This iterative approach is similar to the constant comparison method 
suggested by Glaser and Strauss (1967). We used this method to generate a 
theory about the functions that parting ceremonies serve for displaced mem- 
bers and to identify functions that such ceremonies serve for the manage- 
ment of dying organizations and their parent organizations. 


THE FUNCTIONS PARTING CEREMONIES SERVE 
FOR DISPLACED MEMBERS 


The members of dying organizations share a variety of fates. Organiza- 
tional demise—in contrast to workforce shrinkage—displaces all members. All 
members must cope first with the threatened loss of their organizations and 
roles, then with the actual loss and displacement that follows, and finally 
with the transition to new roles. Unemployment is one outcome of organiza- 
tional death; less widely recognized individual transitions include retirement, 
transfer, changing to a job in another organization, and going back to school. 

The death of an organization causes the loss of an important network of 
mutual obligations. It destroys a major social arena in which members have 
spent much of their time. Because of this loss, a closing is emotionally 
charged; it causes mourning, anger, depression, sorrow, and fear of the 
unknown, the future, and the ambiguous present. 

The events leading to the death of an organization and the demise itself 
also place strong demands on members to make sense of an unfamiliar stream 
of events. In the process of disbanding an organization the cognitive struc- 
tures and strategies individuals used to make sense of their organization and 
their roles within it become obsolete. ] 

The threatened and actual loss of shared organizational arenas and subse- . 
quent role transitions require members to modify how they feel and think 
about themselves, their jobs, and their co-workers. There are many ways in 
which displaced members can respond to such affective and cognitive 
demands. They can seek individual counseling. They can rely on family and 
friends for emotional and financial support. Or they can withdraw. Most 
displaced members use some combination of these coping strategies, each of 
which is useful for partially meeting the demands placed upon them. 

We propose that parting ceremonies offer displaced members additional 
help in coping with the affective and cognitive demands of organizational 
death. Parting ceremonies are unique coping mechanisms because many, 
sometimes all, displaced members can share the antidote at the same time 
and in the same place. Moreover, the members are at once providers and 
receivers of support and coping strategies. 
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Attributes of Parting Ceremonies 


Formal parting ceremonies occurred in six of the eight dying organiza- 
tions we studied. There was much variation in parting ceremonies across 
organizational deaths (see Table 2). The names varied; examples include the 
wake, the last supper, the last hurrah, and the final party. Employees paid for 
some events, unions for others, and management for still others. Further, as 
evidence from the center for the disabled suggests, parting ceremonies held 
for the same organizational death varied. 

The ceremonies emerged in various ways. In the case of the center for 
the disabled, a huge picnic on the last day was suggested by a group of 
employees, but sponsored by management. In contrast, the local union spon- 
sored the two major parties in the large auto plant closing. The first was with 
the permission of the national union, but the second party arose as an act of 
defiance. The national union had ordered the head of the local union to turn 
the money in the local treasury over to them. The local union leader said “‘to 
hell with that" and organized a bash for several hundred people. 

Finally, some parting ceremonies arose despite management's resistance. 
The acting president of the hospital was puzzled by the emergence of a party: 
I didn't expect any parties. In fact, they asked me if the hospital 
would pay for and sponsor a party for the employees. . .. I said I 
didn't think it was appropriate to have the hospital put on a 
party like we would for Hospital Week or something else to 
celebrate. It just didn't seem like we ought to be celebrating this 
to me...so they went ahead and generated their own parties. 

Despite such variation, we were intrigued by the following 11 constant 
elements that occurred across these parties. An example is provided for each 
element. 

1. Participants exchange names and addresses. 

A list of former and current employees’ names, addresses, and 
phone numbers was compiled prior to the picnic at the center for 
the disabled and then distributed at the picnic. 

2. Participants promise to keep in touch. 

At the hospital, a nurse reported that plans were set for a yearly 
reunion of all former employees. 

3. Participants consume food and alcoholic beverages. 

The “last hurrah” for the large auto plant was characterized by 
eating, drinking, and dancing “all day and all night," in the 
words of one informant. 

4. Participants express sadness. 

“It was sad," a member of the academic unit said; “somber, sad I 
guess," said a former auto plant worker. 

5. Participants express anger. 

The head nurse at the hospital reported: "There was a fair amount 
of animosity . . . more than I expected." 
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6. Former members who have left are invited and attend.” 
An informant who attended the wake held for the academic unit 
reported: "Well it was nice for one reason. It was like a reunion 
because there were a lot of people who had been members of the 
organization who had come whom we hadn’t seen for some years. 
So it was nice and felt like a high school reunion.” 

7. Participants discuss their own future and that of others. 
The head of the center for the disabled made the following obser- 
vation about one of their parting ceremonies: "There was a lot of 
talk about what are you going to do next. A lot of people were 
asking me what I was going to do, for instance. I had headed the 
facility. They had some psychological investment, if you will, in 
my future. I guess they felt if I could find a job maybe they would 
too or something.” 

8. Participants tell one another “I guess this means it’s really over.” 
One informant remarked, “I’ve never seen a guy come back from 
the grave after his funeral.” 

9. Participants discuss the causes of the organization’s demise. 
While one parting ceremony was being planned, members of the 
soon-to-be defunct organization designed and produced a t-shirt 
with an inscription that blamed the parent organization for the 
death. This t-shirt was the subject of much discussion at the 
ceremony. 

10. Participants tell stories about the "old place." 
The members of the academic unit told stories about the days 
when the unit was rated as the best in the country, and people 
who could not attend the wake were asked to “please consider 
sending a message perhaps containing an anecdote you remem- 
ber with pleasure." 


11. Participants take photographs. 


One administrator reported: “We got a room down at the Elks 
Club and had a lunch, took a few pictures. There was one point 
where, I didn't get all 20, but I got 17 or 18. I set up my camera 
with the timer and tried to take a picture of the whole crew." 


We propose that tliese 11 constant elements do not arise by coincidence. 
They arise in response to the demands that organizational death places on 
members. As Moore and Myerhoff noted, “ collective ritual can be seen as an 
especially dramatic attempt to bring some particular part of life firmly and 
definitely into orderly control" (1977: 3). These common elements appear to 
serve functions that parallel the affective and cognitive demands faced by 
displaced members. They reflect settings that (1) provide emotional support, 
and (2) facilitate editing of schemata. Figure 1 summarizes the theoretical 
relationships among the affective and cognitive demands, the 11 constant 
elements that arise in parting ceremonies, and the two functions. As Figure 1 
reveals, some constant elements may serve both functions. For instance, 
former members can provide both emotional support and information for 
schema editing. Other elements appear to serve primarily one function; for 


?]t was unclear from our data whether this activity occurred in the large retail store's 
parting ceremony. 
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FIGURE 1 
The Functions of Parting Ceremonies in Dying Organizations: 
A Conceptual Framework 





Eleven Constant Elements 
Demands in Parting Ceremonies Functions 


1. Exchange names and addresses 
2. Promise to keep in touch 
3. Consume food and 
alcoholic beverages 
AFFECTIVE ——» 4. Express sadness EMOTIONAL 
5. Express anger SUPPORT 
6. Former members who have 
left are invited and attend 
7. Discuss own futures and 
those of others 
8. Tell one another “I guess this 
means it is really over" 


COGNITIVE ———* 9. Discuss cause of organization's 


dentise SCHEMA EDITING 


10. Tell stories about the old place 
11. Take photographs 


instance, taking snapshots facilitates only schema modifications. We explore 
the relationship between the constant elements and functions below. 


Emotional Support 


An administrator described the response of employees of the hospital to 
the closing announcement thus: 


“A lot of people cried. An awful lot of people cried, more than I 
expected actually.” 


The head nurse of the dying hospital said: 


“T made sure that I was visible to people. And if they wanted to 
scream at me, they could scream at me. I mean it hurts your 
feelings and everything. Even though you say you're not going to 
take it personally, you do. But I think people needed to vent." 


Organizational death packs an emotional wallop. Survey data from this 
research support the notion that the deaths of organizations are emotionally 
distressing. We asked informants who had been members of the eight defunct 
organizations to compare the closing experience with five increasingly threat- 
ening life experiences: (1) receiving a parking ticket, (2) an argument with a 
friend or relative, (3) a serious illness, (4) a divorce, and (5) the death of a 
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spouse.? The mean response (3.39 on a 5-point scale) indicated that the 23 
former members felt that the closing experience was somewhat worse than a 
serious illness. Responses of 9 of the 23 informants (3996) highlight the 
individual distress generated by these organizational transitions; they reported 
that the closing experience was equivalent to a divorce or the death of a 
spouse. 

In qualitative terms, informants spoke of “missing the old place" and 
how much they had “loved” their organization. The following excerpt from 
the Wall Street Journal further illustrates the sense of loss that can accom- 
pany organizational death: 


A psychologist asked Jerald H. Maxwell, former head of the 
defunct Medical General Inc., “If all your family was killed, how 
would you feel?” He replied, “Bad but losing my company hurts 
worse” (Ingrassia, 1982: 20). 


Because the emotional responses evoked by a closing are negative and 
uncomfortable, members are motivated to cope with them. Emotional sup- 
port can functionally counter these affective demands. Our concept of emo- 
tional support follows from House's (1981) concept of the same name and 
Kahn and Antonucci's (1980) concept of affective support. Emotional sup- 
port includes people's expressions and subsequent perceptions of esteem, 
trust, liking, respect, admiration, and love for each other. It also includes 
their simply listening to each other. 

Those who attend parting ceremonies exchange emotional support in 
both explicit and implicit ways. At least 7 of the 11 constant elements serve 
the function of emotional support, as Figure 1 shows. Exchanging addresses 
and promising to keep in touch suggest, albeit implicitly, that people like 
one another. 

These acts also help members break current social bonds in a way that is 
graceful and socially acceptable. They exchange names and addresses and 
promise to keep in touch because these symbolic acts enable them to leave 
their old roles without embarrassment. In Goffman's (1959) terminology, it 
enables them to save face. It is much easier to walk away from friends with a 
promise to keep in touch. In contrast, people rarely say, "TU never see you 
again," because that would make parting awkward and embarrassing. The 
administrator of the center for the disabled noted: 


it [the party] was very much like when people move away and 
they tell each other that they are really not saying goodbye. They 
promise to come back and visit but it doesn't happen. It is a way 
of making a transition. 


Furthermore, work roles entail more than just commitment to jobs; they 
also require commitment to social systems. People are reluctant to give up 


3 This question was part of a self-administered questionnaire used in the larger study of 
organizational death (Sutton, 1984). Except for using this "life events ladder" (Cobb & Kasl, 
1977: B13-15), the present analysis of parting ceremonies relies exclusively on qualitative data 
gleaned from informants and from records data. Only informants who had been members of the 
eight dying organizations were asked this question; of the 30 members, 23 returned the 
questionnaire. Sutton (1984) contains a complete description of instruments used. 
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both jobs and meaningful relationships. To ease the stress of displacement, 
individuals need to feel that loss of their organizational roles will not sever 
the overarching relationships between members. In their review of the litera- 
ture on adult socialization, Mortimer and Simmons (1978) found evidence 
that “persons with meaningful attachments in several life areas may be pro- 
tected from a sense of overwhelming loss when disengagement from any 
single role becomes necessary” (1978: 426). Huntington and Metcalf (1979) 
discussed how the continuity of life for those not dead is “focused upon and 
enhanced during the rituals surrounding a death” (1979: 93). Similarly, part- 
ing ceremonies provide settings in which participants can reaffirm their 
commitment to each other beyond the bounds imposed by an organization. 
This is also one of the reasons that participants discuss one anothers' futures. 

Sharing food and drink can serve as a symbol for social relationships 
(Gusfield & Michalowicz, 1984); thus, it is another way in which people 
facing adversity can reaffirm a sense of community, or “family,” in the 
words of some informants. Parting ceremonies in this research usually 
included meals, not just snacks. People usually view meals as special times 
that should be shared with family or friends, and hence, they symbolize 
closer relationships than mere membership in the same organization implies. 

Listening to each other express sadness and anger is one way in which 
members of defunct organizations exchange emotional support in these 
ceremonies. As Trice and Beyer (1984) pointed out, certain rites "permit 
expressions of emotion—including unacknowledged conflict—that are not 
allowed on other occasions. Letting off steam doesn't count in the same way 
as it would on other occasions" (1984: 663). Parting ceremonies also provide 
opportunities for others to respond to these sanctioned expressions with 
empathy and understanding. 

The presence of alcoholic beverages may serve to further sanction the 
expression of emotions that are unacceptable in other contexts. Perhaps 
individuals expressed some of the anger and sadness because of the physio- 
logical influence of alcohol on inhibitions. A number of authors, however, 
have argued that the symbolic effects of alcohol on expression are more 
powerful than the physiological effects (Cavan, 1966; McAndrews & Edgerton, 
1969, Van Maanen, 1984). Drinking can be viewed as a socially constructed 
ritual that permits certain behaviors at certain times, including the expres- 
sion of otherwise unacceptable emotions. In addition to loosening people's 
inhibitions, wine, beer, and liquor may serve as common symbols that “can 
minimize for a ceremonial moment their disconnections and conflicts in a 
crowd" (Moore & Myerhoff, 1977: 6). 

The loss of a job has been compared before to the loss of a loved one 
(Greenhalgh, 1979). Experiencing feelings similar to those evoked by the loss 
of a friend or relative, people feel both angry and depressed about losing 
common organizational arenas. Parting ceremonies provide socially accepted 
settings in which members can grieve together for the loss of their organiza- 
tions. Most informants noted the analogy between parting ceremonies and 
funerals. Names such as the last supper and the wake also reflect this analogy. 
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As does the coming together of people bound by a common friend who 
has died, a parting ceremony enables people to say goodbye to one another 
as one large body. The process of mourning exhibited in the parting ceremo- 
nies we studied may also function to help ‘‘detach the survivor's memories 
and hopes from the dead” (Freud, 1952: 65). This sharing of a common loss 
and the support conveyed during this process is accentuated by former 
members' attending and by members' expressing concern for one another 
when future roles are discussed. 

Finally, parting ceremonies may promote constructive social comparison. 
A primary reason that parting ceremonies may arise during organizational 
death is that people appear to seek the company of similar others when they 
are under stress. In a series of classic experiments, Schacter (1959) gave 
subjects expecting to receive electric shocks a choice between waiting alone 
or with others. People who anticipated receiving a painful electric shock 
were more likely to request the company of others than those who antici- 
pated receiving a mild shock. He interpreted these results as evidence that 
people facing high stress prefer the company of similar others so that they 
can evaluate and adjust their responses to the threat. Most people have little 
experience in coping with organizational death. By interacting with others 
who are also trying to cope with the demise, members can assess the appro- 
priateness of their emotional and expressive responses. 


Schema Editing 


It appears that parting ceremonies also provide settings that facilitate the 
editing of displaced members’ schemata. A schema is an internal cognitive 
structure for encoding and representing incoming information with some 
efficiency (Markus, 1977). Schemata are essential to humans because they 
serve as initial frames of reference for behaving and perceiving (Weick, 1979). 

According to Neisser’s (1976) perceptual cycle, individuals’ schemata 
actto direct perceptual explorations of their current environments. Individuals 
may construe the information samples garnered in such explorations as con- 
gruent or incongruent with their schemata. The perception of incongruence 
can lead to schema modifications that in turn lead to somewhat different 
perceptual explorations. 

Parting ceremonies create settings that facilitate such perceptual explora- 
tions. We assert that parting ceremonies act as settings in which displaced 
members can collectively compare, evaluate, and modify their existing 
schemata. We propose that at least six of the behavioral transactions that 
occur among those who attend parting ceremonies facilitate the editing of 
three interrelated schemata. The three are social system, self-, and event 
schemata. Although these schemata are not mutually exclusive, it is useful 
to describe them separately for analytical purposes. 

Social system schemata. A social system schema is a displaced member's 
cognitive map of the history, attributes, and fate of an organization. Parting 
ceremonies provide many opportunities for people to edit their social sys- 
tem schemata. 
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In all parting ceremonies we studied, people spent much time telling 
stories about their organizations and its members. This storytelling did more 
than make people smile. It also helped them to remember, compare mental 
histories, and edit and encode their own stories. Parting ceremonies can 
function as “summaries,” a term borrowed from Albert and Kessler: 

The summary is defined as an abbreviated statement of the his- 
tory of a social encounter. More specifically, it is a "selective 
repetition” of aspects derived from the “history” of the social 
encounter. This repetition is presented in a “condensed” and 
often "symbolic" form .... The summary helps the departing 
participants remember the encounter. A summary processes his- 
toric information in such a way that it allows the participants to 
symbolically take the encounter away with them as they leave 
(1976: 154—155). 
Albert and Kessler also noted that because summaries are selective, they 
make possible abbreviated histories of encounters. Further, the omission of 
certain events due to abbreviation allows for their devaluation. 

Slote (1969) provided an anecdote from a parting ceremony that conveys 
how telling stories can help people edit their mental histories: 

Everyone told his own story, the funny things, the angry things, 
stories about guys who'd left fifteen to twenty years ago, and 
what happened to old so-and-so who worked in S and R and 
used to pitch horseshoes behind the boiler room, incidents out 
of the past came tumbling out, fights, ball games. Perry DeMoss 
came in and had a drink and pretty soon he was claiming he was 
the best second-baseman that Baker ever had, and on it came: old 
Christmas parties; old secretaries; rush orders from Chrysler that 
had kept them hopping week-end nights; the big snowstorm in 
1952, and the time Ramsey and Paddleford backed a truck into 
forty drums and the drums went rolling every which way and 
Ramsey laughed and laughed; and the time the trucks got stuck 
in the yard during an ice storm and the drums started rolling off; 
Steve Toucek's fight with what's-his-name .... And on it went 
like that, stories that were partly true, maybe even mostly true. I 
knew then, watching the men drink and listening to each man 
tell his tale, that each of them had his own Baker, each had his 
own plant, and each had his own memories to last him the rest of 
this life. It had been a good place to work—for all of us (1969: 
319). 

Mental imagery was not the only way that people recorded the histories 
of the social system. Amateur photographers were ubiquitous in the parting 
ceremonies: we studied. The set of people included in each photograph 
appeared to be intentional. Usually at least one photograph included the 
entire group. And many subgroup photos were taken, that often captured 
functional, departmental, and informal structures of defunct organizations. 
As anyone knows who has spent an evening perusing family photo albums, 
photography is an exercise in preserving and editing history. 

In parting ceremonies, as in all phases of organizational death, members 
spend much time discussing why an organization died and who was to 
blame, in order to make sense of an important event in their lives. This is an 
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essential component of the cognitive summaries that displaced members 
leave with. Humans have a strong need to know and understand the causes 
of events, and research suggests (Nisbett & Ross, 1980) that humans seek to 
develop causal hypotheses about events from even the most meager clues, 
especially when the events are important to them. Parting ceremonies offer 
perhaps final opportunities to reaffirm or adjust hypotheses about the causes of 
an organizational death. For example, at the wake held for the academic 
unit, members reaffirmed the “evil” role of the parent organization. And at 
the last supper held by hospital employees, different subgroups developed 
and reaffirmed their different hypotheses about the demise. 

At least two of the organizational leaders in our research understood 
well the human predilection towards placing blame on individuals—particu- 
larly managers (Pfeffer, 1977)—Trather than on events (Ross, 1977). The man- 
ager of the large retail store and the acting president of the hospital both 
chose not to attend parting ceremonies because they feared confrontations 
with displaced workers who blamed them for the deaths. 

Finally, as Turner (1969) asserted, ceremonies do not always serve to 
reinforce static sets of relationships but can also mark shifts in those 
relationships. As Moore and Myerhoff observed: “Since ritual is a good form 
for conveying a message as if it were unquestionable, it often is used to 
communicate those very things which are most in doubt" (1977: 24). In all 
parting ceremonies we studied, people told one another “I guess this means 
it is really over." A parting ceremony provides unambiguous and vivid evi- 
dence of an organization's demise. This helps individuals avoid unhealthy 
denial of a closing, which is common in organizational death (Slote, 1969; 
Taber et al., 1979). 

Self-schemata. Self-schemata are cognitive generalizations about the self, 
derived from past experience, that organize and guide the processing of 
self-related information contained in an individual's social experiences 
(Markus, 1977). The concept of self is a function of the qualities and out- 
comes of an individual's experience. Telling stories about their experiences 
and noting the reactions of others allows displaced members to edit their 
histories as members of the dying organization. Displaced members may 
hear others tell stories about themselves; these stories may lead to affirma- 
tion or modification of self-histories. This storytelling also serves to adjust or 
reaffirm displaced members’ thoughts about qualities such as their ability to 
tell jokes, be creative, work hard, be lazy, or help others. 

It is important to self-schemata that the stories focus on individual actions. 
Organizational stories often contain morals (Bower, 1976; Martin, 1982). 
Stories that emerge in parting ceremonies seem to stress the moral that each 
individual played an important role in shaping the experiences that all shared. 

Parting ceremonies also provide opportunities for displaced members to 
reaffirm or question beliefs about their responsibility for their organization's 
demise. For example, a ceremony provided an opportunity for a manager of 
the retail store to remind others, and thus himself, that the organization died 
despite his fine work to reduce the store's losses. 


Taass: 
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The events of parting ceremonies allow the displaced members to vali- 
date and edit their self-schemata through reification of their previous organi- 
zational experiences. Moreover, such ceremonies serve as settings in which 
they can begin to interpret the implications of the closing event for self. Slote 
(1969) designated displaced members as either winners or losers. Parting 
ceremonies allow individuals to gather data that help them apply such labels 
to themselves. For example, one hospital administrator who received a pro- 
motion from the parent hospital reported being taunted by many of the other 
displaced members who had no jobs: "You've got a job to go back to, you 
know the hospital is gonna pick you up. Why can't the hospital assume all of 
us?" She responded to these remarks by telling us “I started to feel guilty 
about being employed." 

In literary terms, parting ceremonies help displaced members complete 
chapters in their mental autobiographies—chapters about their tenures in the 
defunct organizations. These resemble the abbreviated summaries of social 
encounters that Albert and Kessler (1976) described. 

Parting ceremonies also facilitate planning for the next chapters of these 
autobiographies. By talking with people who have already successfully made 
transitions from organizational membership to new roles, displaced mem- 
bers can cope with the poor self-images that may result from association 
with failure and can begin to plan and picture desired selves (Wicklund, 
1975). 

Event Schemata. Event schemata, or scripts, are like cartoon strips that 
summarize "basic actions that can be executed in a range of possible man- 
ners and contexts" (Nisbett & Ross, 1980). Event schemata help people decide 
what to do next in given situations. Upcoming or current role transitions 
evoked by organizational death may include retiring, starting a new job in a 
parent organization, starting a new job in another organization, looking for a 
new job, going to school, or collecting unemployment benefits. If, as Abelson 
argued, a script is “a coherent sequence of events expected by the individual" 
(1976: 34), then most people have incomplete, inappropriate, obsolete, or 
nonexistent event schemata to guide their behavior through these transitions. 
Extensive research suggests that people experience distress when they are 
unable to predict future events that are important to them (e.g., Seligman, 
1975; Sutton & Kahn, 1986). 

Parting ceremonies create settings in which displaced members can 
explore with each other their ideas for dealing with role transitions. The 
displaced members we studied spent much time talking to one another about 
the future. In a period of individual transition, networks of acquaintances 
can provide timely and important information about job possibilities, influ- 
ential people, or even information about how to sign up for unemployment 
benefits. Informants also reported that parting ceremonies were used to get 
letters of recommendation from former and current supervisors and phone 
numbers of “people you should talk to.” 

Former members who had already made transitions to new roles were 
invited to attend parting ceremonies. These former members spent much 
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time discussing their own transitions and their outcomes. They were often 
asked to judge the quality of coping strategies developed by members who 
had yet to make a transition. A typical question might be, “So what do you 
think about this approach to finding a job?" In short, parting ceremonies 
provide settings that can help displaced members anticipate the future and 
learn what actions are appropriate for that future. 


FUNCTIONS OF PARTING CEREMONIES FOR THE MANAGEMENT 
OF DYING ORGANIZATIONS AND FOR PARENT ORGANIZATIONS 


Our emphasis thus far has been on how parting ceremonies help individu- 
als cope with the displacement caused by organizational death. We assert 
that, in addition to the benefits they offer displaced members, parting ceremo- 
nies may also serve functions for the managements of dying organizations and 
for their parent organizations, if such exist. The theoretical foundations of 
this assertion are less well developed than those for the individual functions. 
It is important that we describe some of these functions, however, because so 
little is known about the management of organizational death (Sutton, 1983). 

We will focus on how management can influence parting ceremonies to 
enhance such functions. However, it is essential to distinguish functions 
from managerial actions. Managers may seek to create ceremonies that 
enhance outcomes they desire, but links between organizational changes 
and managerial intentions can be remarkably weak (March, 1981). Moreover, 
it may not occur to some managers that parting ceremonies can be points of 
intervention. As mentioned earlier, the president of the dying hospital was 
horrified to learn that employees were “celebrating” the demise. Nonetheless, 
some leaders do appear skilled at orchestrating such ceremonies. The head 
of the center for the disabled, for example, used Recognition Day and the 
picnic to help maintain employee commitment and effort. And such actions 
were taken with skill and forethought. 

Specifically, we propose that skilled managers can use parting ceremo- 
nies in order to influence five functions: (1) member motivation, (2) informa- 
tion dissemination, (3) external stakeholders’ acceptance, (4) impression 
management, and (5) guilt assuagement. 


Member Motivation 


Parting ceremonies, particularly those that managements plan and 
sponsor, may satisfy organizational and societal norms about the treatment 
of members. Sponsoring a parting ceremony requires resources, particularly 
time. Employees know that resources are usually scarce during organiza- 
tional death. According to the norm of reciprocity (Blau, 1964), when man- 
agement takes time to plan a ceremony, employees may put forth greater 
effort to complete remaining tasks. Parting ceremonies are also often used to 
recognize people who have been important to an organization. Since the 
organization's formal reward base—job security, advancement, and so forth—is 
often eroded, a ceremony allows management to give a collective pat on the 
back to lower level employees. 
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Perhaps more important, parting ceremonies provide management with 
settings in which they can smooth over or resolve conflicts with employees. 
Management can use the occasion to break down the barriers that separate 
them from employees and promote the image of a shared fate. It is important ` 
that management be able to come across as less villainous and less well-off 
than they appear to be to employees who may feel betrayed or used. Restor- 
ing good will is a crucial step in insuring that employees will continue to do 
their jobs and will not engage in counterproductive behaviors such as sabo- 
tage or theft. 


Information Dissemination 


Parting ceremonies facilitate the sharing of information. Since usually 
most members are present, management can send the same message to all 
through memos or announcements. At the picnic held at the center for the 
disabled, for example, a handout was passed out that described how to get 
unemployment benefits. A more subtle way in which parting ceremonies 
facilitate the dissemination of information was mentioned earlier. A 
ceremony serves as a symbolic way of saying a closing is a reality. 


External Stakeholder Acceptance 


Parting ceremonies also make explicit to other stakeholders that a clos- 
ing is truly going to occur. Stakeholders may be invited to parting ceremonies. 
For example, when parents of residents attended Recognition Day at the 
center for the disabled, it forced them to face hard evidence that the center 
was going to close. Students from the academic unit attended the wake, 
which had a similar effect. Further, when members of other stakeholding 
groups get indirect word of a parting ceremony, it helps them understand 
that a closing is actually going to occur. For example, when suppliers of the 
large auto plant heard that 900 employees had held a party to say goodbye to 
one another, it affirmed that the plant was going to close. Thus, parting 
ceremonies can have a ripple effect that enhances sense-making among other 
stakeholders. 


Impression Management 


When dying organizations are subunits of larger organizations, manage- 
ment can use parting ceremonies to create favorable impressions among 
external stakeholders. Providing employees with a nice picnic and giving 
them certificates of merit, as the center for the disabled did, signaled other 
stakeholders that the center was breaking links in ways that were appropri- 
ate to organizational and societal norms. Breaking links in a normative way, 
or in ways that are more generous than existing norms suggest, is important 
for parent organizations that depend on support from customers or clients 
for survival. In this sense, parting ceremonies become stages on which man- 
agement can further their interests by presenting “administrative dramas" 
that convey messages favorable to them (Meyer, 1982: 533). 
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Guilt Assuagement 


Many of the managers with whom we spoke felt guilty about the closing 
and its negative effects on displaced members. The head of the center for the 
disabled was very upset because almost 70 percent of those whom he had 
helped lay off were collecting unemployment insurance. He also felt guilty 
because the center had not taken enough steps to recognize individual 
employees. Similarly, Slote (1969) reported that the guilt of closing the Baker 
plant led Frank Robertson, the last plant manager, to experience severe physi- 
cal distress. 

Impression management refers to influencing the evaluations of outsiders, 
but assuaging guilt refers to the internal reactions of management to parting 
ceremonies. Sponsoring formal ceremonies in which they can buy drinks, 
buy food, offer thanks, or even offer letters of recommendation for displaced 
employees can help managers reestablish a sense of equity and therefore feel 
less guilt. 


DISCUSSION 


Parting ceremonies that arise during organizational death have been 
described as coping mechanisms for displaced members and as points of 
intervention that leaders can use to manage organizational death. We have 
proposed that parting ceremonies help members cope with the affective 
demands of organizational death by creating settings in which members can 
provide each other with emotional support. We have also proposed that 
these ceremonies help members cope with the cognitive demands of organi- 
zational death because they facilitate the editing of social system, self-, and 
event schemata. Finally, we have described five functions that parting cere- 
monies can serve for managements of dying organizations and their parents. 

These propositions require qualification, expansion, and testing. Our 
aim was to build a data-based theory. Glaser and Strauss (1967) asserted that 
theories grounded in empirical evidence should be superior to those grounded 
only in the imagination. Despite its empirical grounding, the theory offered 
here is not complete. We contend, however, that it provides a useful point of 
departure for subsequent research and theory-building efforts. 

Perhaps the theory’s most evident limitation is the lack of attention it 
gives to negative or dysfunctional aspects of parting ceremonies. We implied 
earlier that some people benefit more from parting ceremonies than others. 
Although placing the blame for a closing on others may complete mental 
pictures for some, the impact on those blamed is likely to be negative. In 
parting ceremonies, as in organizational death itself, there appear to be win- 
ners and losers. 

Why did this study lead us to such a strong emphasis on positive 
consequences? This emphasis may reflect a systematic effort by decision 
makers to justify their actions rather than a ubiquitous quality of parting 
ceremonies. Yet this potential source of distortion is attenuated in this study 
because decision makers often had little stake in the gatherings; people 
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outside of the managerial ranks instigated at least 6 of the 11 ceremonies. 
Moreover, nonmanagers like union leaders and secretaries also emphasized 
the positive consequences of parting ceremonies. 

Borrowing Gestalt terminology, we could say that the positive conse- 
quences of parting ceremonies stand out against the negative background 
created by the death of an organization. The negative consequences of part- 
ing ceremonies, on the other hand, may blend into this background and not 
register as significant. To more fully capture the complexity of parting 
ceremonies, future research and theory-building efforts need to focus on the 
interaction between the negative and positive consequences such ceremo- 
nies have for individuals and managements of dying organizations. 

Another limiting feature of the theory offered here is its neglect of varia- 
tions among parting ceremonies. As Udy (1965) observed, theories about 
organizations can be divided into those emphasizing variables and those 
emphasizing constants. Our focus on constants facilitated building a theory 
that captures the basic structure of parting ceremonies and provides initial 
support for the pervasiveness of these phenomena. It is difficult to examine 
features that differ across settings without an idea of how the settings are 
similar. Thus, the theory offered here may help guide research on the differ- 
ences among parting ceremonies. 

Subsequent theory-building might also address differences between 
organizational deaths in which parting ceremonies occur and organizational 
deaths in which they do not occur. We observed no parting ceremonies in 
two of the eight organizational deaths we studied—the small auto plant and 
small sporting goods store. The qualifier “small” in the description of these 
two organizations is telling. Perhaps efficiency is one reason that formal 
parting ceremonies arise—they provide settings in which most or all mem- 
bers of dying organizations can see one another for the last time. In the small 
retail store, the three members saw one another almost every minute of the 
working day and could handle parting informally. Perhaps formal parting 
ceremonies only arise when there is a need to supplement informal meetings. 
Similarly, in their discussion of ending social encounters, Albert and Kessler 
(1976) asserted that group rituals arise when individual summaries are too 
difficult to develop and when a group ritual is more efficient. 

The absence of a parting ceremony in the small auto plant may have 
been due to smallness; however, other qualities of this organizational death 
may have eliminated the forces that give rise to such ceremonies. Parting 
ceremonies seem to arise to address the shared needs of members who have 
been or are about to be displaced. The small auto plant's death was gradual, 
and no one ever acknowledged that the organization was dying. This gradual 
process may have eliminated the feeling of shared fate that gives rise to 
parting ceremonies. In addition, the owner of the plant engaged in denial 
about the impending fact of organizational death. Perhaps a parting cere- 
mony would have helped him and other members accept the demise of this 
organization. 
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It is also possible that no parting ceremonies emerged in these two 
organizations because of some feature inherent in private sector organiza- 
tions that are independent of larger organizations. As Table 1 reveals, the 
small sporting goods store and the small auto plant were the only two organi- 
zations studied that fit this category. Perhaps parting ceremonies failed to 
arise because there was no need to satisfy the norms of a parent organization 
or a public agency. The present study does not allow us to determine which 
if any of these possibilities explain the absence of parting ceremonies. Future 
research should focus on the emergence of parting ceremonies and the differ- 
ences in circumstances and outcomes between organizational deaths that 
include parting ceremonies and those that do not. 

More broadly, this study suggests that organizational scholars should 
ask how formal occasions, as a class of ceremonies, can enhance organiza- 
tional outcomes. Some anecdotes suggest that effective companies use par- 
ties to socialize and gain loyalty from members. Tandem’s Friday “beer 
busts” are one example (Deal & Kennedy, 1982). 

We need more evidence about the link between ceremonies and organiza- 
tional outcomes. Some companies spend huge sums on ceremonies. During 
the Christmas of 1984, for example, Advanced Micro Devices spent $700,000 
on a single party for 10,000 guests, and Apple Computers spent over $110,000 
for 19 parties (Bellew, 1984). Is such money wasted? Or, as executives from 
these companies claim, are such extravaganzas essential for attracting the 
best talent, maintaining commitment, and keeping the best employees from 
jumping ship? 

The research described here and the theory that follows from it suggest 
that formal ceremonies play an important role in organizational death and in 
other domains of organizational life. We have argued that parting ceremo- 
nies have benefits for displaced members, for management, and for external 
stakeholders. This perspective suggests that subsequent theory-building and 
testing should explore the opportunities and hazards that formal ceremonies 
hold for the well-being of organizations and their members. Only in this way 
can we begin to assess the psychological and economic value of Friday beer 
busts, Christmas parties, sales conventions, and annual company picnics. 
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In 22 municipal bus companies, the extent of union members’ dual 
commitment—to their unions and to their employing organizations— 
covaried with interorganizational differences in labor-management 
climates. Dual commitment was higher in cooperative climates than in 
less cooperative ones. The extent of members’ participation in the union 
process moderated this relationship. It was relatively strong for more 
active participants and relatively weak for the less active. Separate 
organizational and union commitments also covaried with labor- 
management relationship climate but in a less monotonic manner than 
dual commitment did. 


Can unionized employees develop simultaneous commitments to their 
unions and their employing organizations, or is there an inherent conflict of 
allegiance in such dual-membership situations that forces an either-or choice? 
This was the essential question that formed the basis for a flurry of research 
on dual loyalty or dual allegiance some 25 years ago. That research left the 
issue largely unresolved. This study once again analyzes the dual commit- 
ment concept in relation to two variables: (1) ambient labor-management 
relationship climate, and (2) extent of members’ participation in union 
activities. This study is the first to use independent sources to measure 
labor-management relationship climates and dual commitment. 


THEORETICAL FRAMEWORK 


During the 1950s and early 1960s, an apparent concern that the rapid 
growth of the unionization might compete with employees’ commitment to 
their employing organizations stimulated much research on dual allegiance 
or dual commitment (Gallagher, 1984). Early work by such scholars as Dean 
(1954) and Purcell (1960) indicated that most workers show allegiance to 
both union and management. Dean’s analysis of three unionized plants incor- 
porated an extended period of observation and interviews with union and 
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management officials and rank-and-file members, followed by a question- 
naire survey. In all three plants she found that those workers who were 
most favorable toward management also tended to be the most likely to give 
their union leaders high ratings. However, as Dean acknowledged, it was 
unfortunate that she asked a direct question about dual loyalty in only one of 
the plants. Purcell’s landmark study, involving a single unionized meat pack- 
ing plant in Chicago, was even more restricted. As Dean did with most of her 
data, Purcell assessed dual allegiance by tallying proportions of workers 
whose responses to questions tended to be favorable toward both the com- 
pany and the union. 

Other scholars of the period, including Barkin (1950), England (1960), 
and Kornhauser, Sheppard, and Mayer (1956) argued against workers’ ability 
to display simultaneous loyalty to both systems. Barkin, for example, asserted 
that the notion that employees can maintain loyalty to companies and to 
unions concurrently "does not stand up under even a cursory examination" 
(1950: 64). 

After about 1960, the issue of dual allegiance lay essentially unresolved 
and dormant for about a quarter of a century, until a new generation of 
researchers raised it again. In the intervening period, a strong research theme 
developed around the concept of organizational commitment (Porter, Steers, 
Mowday, & Boulian, 1974)—a concept having obvious relevance to dual 
allegiance. Indeed, it has been central to recent research on dual allegiance 
(Fukami & Larson, 1982, 1984; Gallagher, 1984; Martin, 1981; Martin, 
Mageneau, & Peterson, 1982). 

Reminiscent of the earlier work of Dean (1954) and Purcell (1960), such 
recent research has tended to measure commitment to employing organiza- 
tions and commitment to unions separately and has then combined these 
measures to show the extent of dual commitment. For instance, Gallagher 
(1984) tallied employees showing strong or moderate positive responses to 
separate union commitment and organizational commitment scales and used 
the proportion that fell into these classifications on both scales as an index of 
dual commitment. In Martin’s (1981) study of a federal government facility, 
questionnaires that included scales measuring attitudes toward unions and 
toward management’ were administered to union members and nonmembers. 
This research assessed dual allegiance by recording the proportion of employ- 
ees whose responses were favorable toward both referents. Fukami and Lar- 
son (1982) used the correlation between employees’ scores on union commit- 
ment and organizational commitment to measure dual commitment. 

/ Commitment has been anything but a consensual construct in the gen- 
eral literature. It has been described in such diverse ways as: attachment to 
and identification with an organization (Buchanan, 1974); a binding of atti- 
tude and belief to prior behaviors (Salancik, 1977); a resignation to a status, 
under penalty of forfeiture of the costs of attainment of that status (Becker, 
1960); and a dependency relationship in which maintenance of an indivi- 


1 These scales were adapted from Stagner, Chalmers, and Derber (1958). 
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dual’s "internal being requires behavior that supports the social order" 
(Kanter, 1972: 66). Such diverse conceptual frameworks and their operational 
definitions—including several variations on why and how people become 
committed and how to measure commitment (Angle & Perry, 1983; Kiesler, 
1971; Ritzer & Trice, 1969; Stevens, Beyer, & Trice, 1978)—-have created a 
situation in which it is often difficult to make direct comparisons among 
commitment studies. 

Diversity in conceptualizations and measurements of commitment has 
had clear effects on the coherence of recent research on dual commitment. 
For example, although both Gallagher (1984) and Fukami and Larson (1982) 
measured dual commitment by combining scores on separate measures of 
commitment to union and organization, they measured commitment in 
different ways. Gallagher employed a scale based on the work of Porter and 
colleagues (1974), but Fukami and Larson adopted a measure patterned after 
the work of Ritzer and Trice (1969). These two scales rest on quite different 
conceptualizations of commitment and tend to be only moderately corre- 
lated (Ferris & Aranya, 1983), making direct comparison problematic. The 
underlying distinction between these conceptualizations regards the extent 
to which an affective, as opposed to a calculative, attachment is presumed. 
This duality resembles Etzioni’s (1975) distinction between moral and calcu- 
lative involvement. The instrument developed by Porter and colleagues em- 
phasizes members’ moral involvement, but Ritzer and Trice’s approach ap- 
pears to have a more nearly calculative involvement as its premise (Ferris 
& Aranya, 1983). 

In calculative involvement, bonding between member and social system 
results from a committed individual’s linking the relationship to extrinsic 
outcomes on the basis of costs and benefits. Becker (1960), for example, 
defined commitment in terms of aversion to bearing costs of forfeiture. This 
refers to losses of investments or side bets that would be incurred in severing 
a relationship. Such an instrumental orientation seems incongruent with 
common definitions of allegiance or loyalty, which usually connote an affec- 
tive attachment or sense of devotion. 

The perspective on organizational commitment taken by Porter and his 
colleagues (Mowday, Steers, & Porter, 1979, 1982; Porter et al., 1974) comes 
closer to such notions of loyalty and allegiance. This organizational behavior 
approach (Staw, 1977) essentially treats commitment in terms of individuals’ 
psychological attachments to social systems. According to Porter and his 
colleagues, a committed employee: (1) has a strong desire to remain a mem- 
ber of his or her organization; (2) internalizes the values and goals of that 
organization; and (3) is willing to work extra hard on behalf of the organiza- 
tion. Thus, attachment to membership derives not from economic exchange, 
but from such processes as identification and internalization (Kelman, 
1958)— processes more clearly consonant with loyalty or allegiance. 

A great deal of published empirical research across a range of organiza- 
tional and employee types has employed the measurement procedure associ- 
ated with this approach to commitment (Mowday et al., 1979). Thus there is a 
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substantial amount of normative and psychometric information regarding 
the measure. Furthermore, a major research effort on commitment to unions 
appears to have its roots in this framework (Gordon, Philpot, Burt, Thompson, 
& Spiller, 1980; Ladd, Gordon, Beauvais, & Morgan, 1982). 

In addition to studies of organizational commitment, the period between 
the two waves of research on dual allegiance or dual commitment has seen at 
least two other streams of research develop that appear clearly relevant to 
the issue. One such research domain is role conflict (Kahn, Wolfe, Quinn, 
Snoek, & Rosenthal, 1964; Rizzo, House, & Lirtzman, 1970). A particularly 
relevant form of role conflict in the present context would appear to be 
interrole conflict, in which an individual simultaneously occupies two or 
more roles that may make incompatible demands; the roles of union member 
and organization member are an example. Lieberman’s (1956) famous study 
of attitude shifts illustrated how group membership can affect attitudes and 
values and in particular how such values differ between union and manage- 
ment identifications. He studied individuals as they changed their primary 
roles, moving back and forth between being workers and either union stew- 
ards or foremen. 

The other stream of research involves a distinction between cosmopoli- 
tans and locals. This concept concerns the dilemmas facing professionals 
whose values and loyalties to organizations and professions sometimes col- 
lide (Gouldner, 1957; Miller & Wager, 1971). In a classic study of the occupa- 
tion of personnel manager, Ritzer and Trice (1969) illustrated that the issue 
of dual allegiance is not limited to labor-management relations, but also 
clearly applies to organizational versus occupational commitments. 

Gouldner (1957) and other early researchers saw cosmopolitanism and 
localism as decidedly zero-sum. Cosmopolitanism and localism appeared to 
be antithetical; strong attachment to a profession precluded attachment to an 
organization and vice versa. However, more recent research has suggested 
that an either-or choice is not inherent in these orientations. Ritzer and Trice 
(1969) found personnel managers to be committed to both their occupation 
and their employing organizations "and only slightly more to their occu- 
pation" (1969: 33). Miller and Wager (1971) held that the two orientations 
need not be mutually exclusive when the expectations and role demands of 
profession and organization remain congruent—if, for example, an organiza- 
tion does not demand behavior contrary to professional norms. Accordingly, 
cosmopolitanism and localism need not be viewed as incompatible. Rather, 
they may coexist, contingent perhaps upon the extent to which individuals 
can avoid choice dilemmas or fundamental role conflicts. 

These two concepts, role conflict and cosmopolitanism versus localism, 
are obviously relevant to the study of labor-management relations to the 
extent that unions and employing organizations force members to choose 
between incompatible values or make incompatible behavioral demands. 
There is substantial evidence that various union-management interfaces dif- 
fer systematically with respect to how compatible or incompatible the per- 
ceived orientations of the two parties are. Scholars have long held that they 
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can characterize labor-management relations by the relative amounts of ambi- 
ent conflict or cooperation that reside in the two parties’ orientations toward 
one another. This background condition has been called attitudinal climate 
(Chalmers, Chandler, McQuitty, Stagner, Wray, & Derber, 1954), models of 
labor management relations (Harbison & Coleman, 1951), or, perhaps most 
commonly, labor-management relationship patterns (Walton & McKersie, 
1965). Whatever the terminology, the underlying idea has been quite 
consistent. As labor and management interact in a given setting over time, 
each develops a somewhat stable frame of reference regarding the orienta- 
tions of the other party; in effect, this frame of reference describes where the 
fundamental relationship lies along a cooperation-conflict continuum. Within 
such frames of reference the parties engage in sensemaking behaviors— 
interpreting the meanings of otherwise ambiguous events and making attribu- 
tions (Shaver, 1975) regarding the intentions, motivations, and beliefs that 
underlie the other party's actions. 

It appears reasonable to expect that, whatever other variables might 
influence the likelihood of dual commitment, the ambient relationship cli- 
mate will be a factor. Social-psychological theories of cognitive consistency 
predict, for example, that individuals who try to be loyal to two conflicting 
social systems will encounter considerable cognitive dissonance (Festinger, 
1957). Indeed, both the old and the new research on dual allegiance or dual 
commitment contain this premise. Gottlieb and Kerr (1950), in their survey 
of the Buchsbaum Company, concluded that the high correlation between 
attitudes toward management and toward the union was associated with a 
62-year history that was devoid of union-management hostility. Rosen (1954) 
also suggested that dual allegiance is most likely under circumstances in 
which no intense role conflict is activated. Furthermore, although Dean 
(1954) concluded that “there is little... that precludes dual loyalty, even 
where the union-management relationship is antagonistic" (1954: 536), the 
patterns she reported in her comparison of three plants appeared to indicate 
that participation in union activities and dual allegiance were more apt to 
occur together in the plant whose climate she designated cooperative than in 
the plants whose relationships were characterized by conflict. Her observa- 
tion suggests that the intensity of members' personal involvement in a union 
might moderate any relationship between dual loyalty and labor-management 
relationship climate. 

In more recent research, Martin (1981), using Walker and Lawler's (1979) 
classification scheme, found that dual allegiance was typical of union mem- 
bers in protective unions, and unilateral allegiance was typical in aggressive 
unions. Fukami and Larson (1982) found a significant correlation between 
commitment to union and to company (r= .27, p<.001). However, when they 
partialled out employees' perceptions of the union-management relationship, 
the correlation fell to a nonsignificant .06. Gallagher (1984) reported a sim- 
ilar finding; when he controlled for individual perceptions of the quality of 
the union-management relationship, positive and statistically significant rela- 
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tionships between union members’ organizational commitment and union 
commitment became nonsignificant. 

By and large, the research we have reviewed—particularly the new wave 
of research—tends to lead to the following conclusions: (1} dual commitment 
is a relatively common phenomenon—70 percent of Gallagher’s (1984) union 
members were so classified; (2) the ambient labor-management relationship 
climate might explain much of the variance in dual commitment. However, 
both the old and the new studies have been vulnerable to certain response 
artifacts. It has been the norm to assess dual commitment by counting the 
numbers of people whose attitudes were positive toward both union and 
management. Furthermore, in each of the studies reported earlier in this 
paper, both the commitment measures and the judgments of the quality of 
the overall union-management relationship derived from the same self-reports. 
Thus, yea-saying response bias could inflate both the extent of dual commit- 
ment per se and observed covariation in measures of relationship climate 
and commitment. 

Overcoming such artifacts necessitates deriving a direct measure of dual 
commitment. It also requires using independent sources to measure: (1) 
labor-management relationship climate, and (2) employee commitments. Pro- 
viding an opportunity for variance to occur in a relationship-climate mea- 
sure mandates conducting a study across several independent units or 
organizations rather than in a single organization that would theoretically 
have a single climate. 

The research reported here investigated the relationships between ambi- 
ent labor-management relationship climates and both dual and separate mem- 
ber commitments to employing organizations and unions. In addition, we 
assessed the moderating influence of member participation in union activi- 
ties on this relationship. We included this aspect in view of Dean’s (1954) 
observation that dual loyalty and active participation in union activities 
tended to co-occur where an overall relationship was relatively cooperative. 

Hypotheses were as follows: 


Hypothesis 1: Relatively high levels of dual commitment 
will occur in organizations whose ambient labor-manage- 
ment relationship climates are relatively cooperative. 


Hypothesis 2: The relative level of members participa- 
tion in their unions' activities will moderate the relation- 
ship between climate and dual commitment. This rela- 
tionship will be stronger under conditions of high member 
participation than under low participation. 


In addition to investigating the hypothesized relationships, an explor- 
atory analysis investigated the covariance between the labor-management 
relationship climate and sole commitments to organization or union. However, 
we established no directional hypotheses concerning sole commitments. 
Although observers often note that conflict with an out-group polarizes 
in-group members and increases their cohesiveness, in this instance the 
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members in question were not members of one system alone but were attached 
to both groups. Thus our treatment remained exploratory. 


METHODS 
Setting and Participants 


This research was part of a larger study of labor-management relations 
and organizational performance in 28 separate municipal bus companies in 
the western United States. We gained access to each organization by corres- 
ponding with its general manager. We obtained access to union members 
through separate negotiations with the leadership of the appropriate unions. 

Teams of two researchers made two-day long visits to sites in the sum- 
mer of 1977. We administered separate-form questionnaires to: (1) transit 
organization managers who were involved in union-management relations; 
(2) their opposite-number labor leaders; and (3) rank-and-file employees 
who were members of the bus drivers’ bargaining unit. As we were able to 
administer the rank-and-file questionnaires in only 22 of the 28 organizations, 
the organizational N for purposes of this article is 22. 

With the number of bus drivers employed serving as an indicator, organ- 
izations varied in size from 13 to 4,940, with a median size of 58. When 
maintenance personnel, who were normally in the drivers’ bargaining unit 
and thus included in the survey, are taken into account, median organization 
size was 69. 

Eighteen of the organizations were unionized, 11 in transit unions and 
seven in general unions; drivers in the remaining four were represented in 
collective bargaining by local employee associations that were functional 
equivalents of unions. Although only 12 of the organizations had union 
shops and three others were agency shops, 85 percent of the respondents to 
the rank-and-file survey were members of unions or associations. During 
their most recent rounds of negotiations, strike votes had occurred in eight of 
the organizations and the membership had supported strikes in seven; 
however, only one organization had seen an actual strike. 

Respondents either hand-delivered their anonymous completed ques- 
tionnaires to us or mailed them in prepaid mailer envelopes. Response rates 
and usable response Ns were 84 percent (N —67) for managers, 73 percent 
(N=57) for labor leaders, and 64 percent (N=1,244) for rank-and-file 
employees. Only members of the union or employee association in each 
organization answered questions on dual and union commitment. This re- 
duced the rank-and-file N for analyses involving union commitment or dual 
commitment to 1,057. 

Bus drivers accounted for 89 percent of the rank-and-file sample; 6 per- 
cent were driver supervisors or bus maintenance personnel; the remaining 5 
percent were clerical and related personnel. Rank-and-file respondents were 
82 percent men, 74 percent white, 14 percent black, and 10 percent Hispanic. 
Mean age was about 38, with 28 percent younger than 30 and 18 percent 
older than 50. Eighteen percent had worked less than two years in their 


38 Academy of Management Journal March 


present organizations, and 30 percent had been employed there longer than 
five years. 


Measures 


Each question in the scales listed in this section had a 7-point Likert 
agree/disagree format. We incorporated scales as appropriate in question- 
naires administered to transit managers, labor leaders, and rank-and-file 
employees. The Appendix contains all items for these scales, except for the 
Organizational Commitment Questionnaire, which has been fully described 
elsewhere (Mowday et al., 1979). 


Manager and Labor Leader Questionnaires 


Labor-management relationship climate. The ambient labor-management 
relationship climate for each organization was measured by a 23-item scale. 
We adapted most of the items from the University of Michigan Organiza- 
tional Assessment Package (Institute for Social Research, 1975). Parallel forms 
were developed so that each side responded to questions about their percep- 
tions of their opposite numbers. Reliabilities for the scales (a) were .92 for 
managers and .95 for labor leaders. 


Employee Questionnaire 


Dual commitment. A 5-item Likert scale measured dual commitments to 
employing organizations and to unions or employee associations (a — .71). 

Commitment to organization. The 15-item Organizational Commitment 
Questionnaire (OCQ) (Mowday et al., 1979) measured rank-and-file employees’ 
commitments to their employing organizations (a =.89). 

Commitment to union. A 4-item scale measured members' commitments 
to unions or employee associations (a = .76). 

Union participation. We asked union or association members eight yes 
or no questions regarding participation in various union activities in order to 
derive their relative levels of union participation. Each employee's union 
participation score was a simple sum of yes answers. 


Analysis 


Including several separate organizations in the research and employing 
separate questionnaires for managers, labor leaders, and rank-and-file union 
members enabled use of independent data sources for measuring indepen- 
dent and dependent variables. We assessed labor-management relationship 
climates by means of the managers' and union leaders' questionnaires. Dual 
commitment, sole commitments to union or employing organization, and 
union participation were measured through questionnaires administered to 
the rank-and-file. This approach eliminated the problem of single-source 
contamination of independent and dependent variables for all analyses except 
those relating union participation to commitment. 

Administration of transit-manager questionnaires was limited to general 
managers and other managers who had been directly involved in labor- 
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management relations. We gave the labor-leader questionnaires to their coun- 
terparts on the union side. Accordingly, the measure of each organization's 
ambient labor-management relationship climate derived from reports of per- 
sons best able to make that assessment. This procedure resulted in our obtain- 
ing from 4 to 11 usable returns per organization. 

To derive scores for labor-management relationship climate for each 
organization, we first aggregated the scores within each organization for (1) 
labor-leaders and (2) managers. We then assigned the mean of these two 
scores as an organization's score on labor-management relationship climate. 
All rank-and-file members then received one of 22 climate scores that were 
based on the consensus of the labor leaders and managers from their own 
organizations. 

Constructing organization-level variables by aggregating individual per- 
ceptions is problematic, but under the right conditions this approach can 
yield valid indices of organizational properties (Lincoln & Zeitz, 1980). In 
order to justify our assigning a single climate score to each organization, we 
considered it essential that there be high degrees of interrater agreement 
within each organization. In particular, union and management sources 
should report highly similar perceptions of the labor-management climate at 
given locations. Historically, however, the evidence for such consensus has 
been mixed. Rim and Mannheim (1964) found a significant correlation (r = 
.30, p < .05) between selected labor-management attitudes of general manag- 
ers and union representatives. But Stagner, Chalmers, and Derber (1958) 
concluded from the Illini City data that relationships above the level of the 
rank-and-file are too complex to permit consensus regarding a labor- 
management relationship climate. 

In the present research, the Pearson product-moment correlation of these 
pairs of scores for the 22 organizations was .84 (p « .001). To ascertain 
whether this ostensible agreement might have resulted from a washing out of 
within-group variance through the averaging process, we conducted a one- 
way analysis of variance across the 22 organizations, with the identities of 
the organizations involved treated as the categorical variable. The individual 
scores of all labor and management respondents comprised the dependent 
variable. The F-ratio obtained was 6.25 (p « .0001). We repeated this analysis 
twice: once using only labor leaders' scores, and once using only managers' 
scores. Again, in each instance, the F-ratio was statistically significant 
(p < .001). 

Hierarchical regression analysis was performed in order to assess the 
relationship between the labor-management climate and dual commitment 
and to test for the moderating influence of union participation. We followed 
Arnold's (1982) procedural recommendations and used dual commitment as 
the dependent variable. This procedure was designed not only to measure 
each independent variable's unique influences on dual commitment, but 
also to test for the presence of a true moderator variable that altered the form 
of the relationship between dependent and independent variables. As we 
had hypothesized that the main effect would be the relationship between 
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climate and dual commitment, and that union participation would moderate 
this effect, we entered three independent variables into the regression in the 
following order: (1) labor-management climate, (2) union participation, and 
(3) a multiplicative interaction term (climate x participation). 


RESULTS 


Table 1 presents descriptive statistics and the intercorrelation matrix for 
the variables of interest in this research. For the overall sample, the correla- 
tion between labor-management relationship climate and dual commitment 
was .32 (p < .001). Although, not surprisingly, the correlation between union 
participation and commitment to a union was statistically significant (r = 
.22, p < .001), the correlations between union participation and dual commit- 
ment and between union participation and organizational commitment were 
approximately zero. 

There was no apparent relationship between labor-management relation- 
ship climate and recent strike-related activity. The one organization that had 
experienced a strike was near the median of the labor-management climate 
rank ordering, and the seven that had voted to strike were distributed nearly 
evenly across three-quarters of the range of scores. 

Figure 1 portrays the overall relationship between labor-management 
climate and dual commitment. Each data point represents one organization, 
plotted at its members' mean level of dual commitment on the ordinate and 
at the organization's score for labor-management relationship climate along 


TABLE 1 
Descriptive Statistics and Intercorrelation Matrix 
for Commitment, Climate, and Participation? 





Variables Means s.d. 1 2 3 4 
1. Dual commitment 4.39 1.21 
2. Union commitment 4.65 1.27 -40*** 
(1,035) 


3. Organizational 
commitment 4.50 1.12 .48*** .30*** 
(1,051) (1,052) 
4. Labor-management 


relationship climate 4.53 0.73 .92*** .09** .19*** 
(1,057) (1,056) (1,205) 
5. Union participation 2.48 1.20 .03 :22*** —.02 .07* 


(1,038) (1,027) (1,076) (1,098) 





a Figures below each correlation coefficient in parentheses indicate N for the correlation. 
> Organizational commitment measure included employees who were not union members. 
* Mean and s.d. are based on 1205 cases, each of which was assigned one of 22 possible labor- 
management climate scores. These scores were derived from 155 responses of managers and 
labor leaders; the mean of that distribution was 4.48, with a standard deviation of 1.10. 
* p<.05 
** p«.01 
*** p«.001 
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FIGURE 1 
Scatterplot of the Labor-Management Relationship 
Climate and Dual Commitment? 


Dual commitment 





3.0 4.5 6.0 
Labor-management relationship climate 


è Each plotted point is the average pair of scores for one organization. 


the abscissa. The correlation between dual commitment and climate, with 
organizations as the units of analysis, is .73 (p «.001). 

Table 2 summarizes the results of the moderated regression analysis. 
The labor-management relationship climate explains about 10.3 percent of 
the variance in dual commitment (p « .001). When union participation is 
entered next, there is virtually no change in R? and the F-to-enter is a nonsig- 
nificant 0.1. Finally, when the interaction term, climate X participation, is 
entered, explained variance climbs to 13.4 percent (p « .001). Thus, union 
participation appears to meet the test of a true moderator variable; it accounts 
for a significant increment in variance explained in interaction with the 
designated independent variable, labor-management relationship climate 
(Arnold, 1982). 
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TABLE 2 
Hierarchical Multiple Regression Testing 
the Moderating Effect of Union Participation on Dual Commitment 





Steps/Variables Entered B F p R? Change Cumulative R? 
1. Labor-management 
relationship climate .535 118.4 .000 103 .103 
2. Participation .011 0.1 n.s. .000 .103 


3. Interaction term 
(1 x 2) .054 36.4 .000 .031 .134 





* F to enter or remove. 


The effect of the interaction term, which had a positive regression 
coefficient, was to amplify the relationship between the labor-management 
relationship climate and dual commitment; the relationship covaried with 
activity. To illustrate this effect, we performed extreme-groups regressions 
for subsamples one standard deviation above and one standard deviation 
below the mean for union participation. Whereas the regression coefficient 
for climate on dual commitment had been .535 for the entire sample, it 
increased to .603 for the high-participation subgroup and decreased to .446 
for the low-participation subgroup. 

Because there were large variations in organizations' sizes, we were 
concerned that either the labor-management relationship climate or its rela- 
tionship with dual commitment might somehow be a function of size. 
However, the Pearson correlation between size and labor-management cli- 
mate was only -.03 (n.s.). Furthermore, the Pearson correlation between dual 
commitment and the labor-management relationship climate (r — .32) was 
not changed by partialling out organization size. 

In order to illustrate the monotonic nature of the relationship between 
the labor-management relationship climate and dual commitment, a subsid- 
iary analysis was conducted. We divided the sample of organizations into 
subgroups and performed a one-way analysis of variance with dual commit- 
ment as the dependent variable. For this purpose we arbitrarily selected five 
climate categories, though the point could be illustrated with fewer or more 
categories.? We partitioned the rank-and-file sample into categories by divid- 
ing it at the 20th, 40th, 60th and 80th percentile points of the distribution for 
labor-management relationship climate. 

Table 3 illustrates the results of that analysis. Dual commitment was 
strongest in organizations and bargaining units that coexisted in the most 
cooperative relationship climate and weakest in the least cooperative climate, 
with a nearly linear progression in between. However, only two of the four 
intercell contrasts were statistically significant. 


? Walton and McKersie's (1965) well-known scheme contains five categories, although we 
make no claim that our subgroups accurately reflect their classifications. 
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l TABLE 3 
ANOVA Results: Dual Commitment and Sole Commitments 
to Union and Management with Labor-Management 
Relationship Climate as the Categorical Variable 





Independent Variables Category (N=)? Means F Cell Contrasts 
Dual commitment 1 (142) 4.83 30.72*** 
2 (333) 4.66 1-2 
3 (321) 4.41 2—3** 
4( 32) 4.23 3—4 
5 (229) 3.70 4—5* 
Organizational 1 (159) 4.89 11.74*** 
commitment 2 (374) 4.62 1-2* 
3 (347) 4.31 2—3*** 
4( 34) 4.63 3-4 
5 (266) 4.26 4-5 
Union commitment 1 (145) 4.49 8.60*** 
2 (335) 4.96 e E 
3 (317) 4.60 2-3*** 
4( 28) 4.19 8—4 
5 (230) 4.44 4—5 





? Labor-management relationship climate, the categorical variable, was divided into 5 cate- 
gories at the 20th, 40th, 60th, and 80th percentiles. The most cooperative climate is category 1 
and the least cooperative is category 5. 

b Union commitment and dual commitment Ns include only union members; organizational 
commitment N includes all rank-and-file employees. 

*p«.05 
**p<.01 
***p-.001 


Table 3 also contains equivalent one-way analyses of variance of organi- 
zational commitment and union commitment across the same five categories 
of labor-management relationship patterns. Although commitment to employ- 
ing organizations followed a pattern similar to that of dual commitment, 
there was a slight departure from monotonicity in one cell. In the case of 
union commitment, a sharp departure from monotonicity occurred, with the 
extreme groups showing nearly identical levels of commitment to unions. 
Whereas dual commitment had shared 10.2 percent common variance with 
the labor-management relationship pattern, common variance was only 3.6 
percent for organizational commitment and less than 1 percent for union 
commitment. We performed no moderated regression analyses for these two 
independent variables because the analyses were exploratory; interpretation 
of any significant interaction effect is legitimate only where there has been 
an a priori theory suggesting such an effect (Arnold, 1982). 
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DISCUSSION 


Using independent measures of labor-management relationship climate 
and dual commitment, this study confirmed that the extent of dual commit- 
ment expressed by rank-and-file members is related to the ambient relation- 
ship between labor and management within their organizations. The likeli- 
hood of simultaneous commitment to two interacting systems such as a 
union and an employing organization appears to grow where the relation- 
ship between the systems is cooperative. Although this finding is both intu- 
itively reasonable and congruent with some of the earlier research cited in 
this paper, we are unaware of any other research on this topic in which data 
have so thoroughly supported theory. 

The finding of a significant interaction between the quality of a union- 
management relationship and the extent of members’ union participation is 
consistent with our fundamental argument. We stated that simultaneous 
commitment to two social systems is problematic to the extent that those 
systems are not cooperative and may force members to make either-or choices. 
To review the basic premise, we posited that an ambient relationship climate 
would establish the extent to which members would face choice dilemmas 
in forming loyalty attachments to the two systems. Yet it has long been 
known that people are imperfectly rational and can often compartmentalize 
their logic so as to hold simultaneously incompatible opinions (Cyert & 
March, 1963; Simon, 1976). That is, they may be able to do so until they must 
confront inherent incompatibilities. The effect of member participation, then, 
is to make any existing incompatibility salient and thus to amplify the funda- 
mental effect of climate on dual commitment. In effect, where two systems 
tend toward the low-cooperation end of the labor-management climate 
continuum, the inherent choice dilemma is most salient to union activists. 

In combination, these two variables—labor-management relationship cli- 
mate and union participation—seem to go a long way toward clarifying the 
mixed findings of earlier research on dual allegiance that failed to resolve 
the question of whether workers can be loyal to both their employing organi- 
zations and their unions. Perhaps earlier researchers were asking the wrong 
question. A more appropriate question might be, "What are the factors that 
enhance or decrease the likelihood that workers will develop dual commit- 
ment?" The research reported here has provided at least a partial response to 
that question by implicating labor-management relationship climate and mem- 
ber participation in union affairs as factors that influence the extent of dual 
commitments. 

Some methodological observations are in order. We admit that measur- 
ingan organizational attribute by aggregating self-reported data from members, 
as we did to create the variable for labor-management relationship climate, is 
controversial. Indeed, the study of organizational climate has endured much 
debate as to whether there is any such thing as an organizational climate 
(James & Jones, 1974; Schneider, 1975). The criticism is essentially that 
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researchers using survey methods have simply combined disparate percep- 
tions into fallacious averages. 

In this instance, however, we are confident that our measure of the 
labor-management relationship climate represents a meaningful attribute of 
each organization, not a spurious consensus. The scale's reliabilities were 
themselves high. Moreover, the high correlation between the perceptions of 
managers and their labor counterparts indicated that labor and management 
appeared to see ambient labor-management relationship climates in much 
the same way. One-way analyses of variance showed that within-organization 
variance on perceptions of climate was relatively low when compared to 
between-organization variance. Moreover, the uniformly strong relationships 
between these managers and labor leaders’ perceptions of climate and the 
commitment levels of other members of the same organizations add further 
evidence that labor-management relationship climates, as measured in this 
study, have real meaning. In sum, we believe that the data justify our treating 
labor-management relationship climate as an organizational attribute. 

Regarding causality, cross-sectional research of this type obviously can- 
not establish whether ambient labor-management relationship climates actu- 
ally result in varying levels of dual commitment or whether other factors 
codetermine climates and commitments. We believe a logical case can be 
made that role dilemmas imposed by membership in conflicting systems 
must necessitate a loyalty choice between adversaries. However, it remains 
to more elaborate, perhaps longitudinal, designs to have the final say. 

Assuming the correctness of the causal relationship identified in the 
preceding discussion, we see these findings as having important implica- 
tions for both labor and management, especially in light of current contex- 
tual developments. Deregulation of heavily unionized sectors—airlines, 
trucking—and increasing internationalization of competitive markets are two 
examples of such developments. Adapting to these changes will increas- 
ingly require both parties to make strategic choices about appropriate mixes 
of conflict and cooperation in their relations. This research suggests that 
both substantive strategies concerning extent of conflict or cooperation and 
unions' processes for implementing them will bear on members' subsequent 
attachments to unions and to employing organizations. These attachments 
are, in turn, of consequence because of their relationships to organizational 
performance. For example, previous research has shown that decertification 
is more likely for unions where dual commitment is low (Angle & Perry, 
1984). Leadership efforts to improve a climate, rather than to hold firmly to 
an adversarial position, would appear to be the only rational strategic choice, 
particularly when the survival of one or both parties is problematic and 
members' loyalties are essential. Although this implication from the research 
is somewhat speculative, it is ripe for additional study. 

Dual commitment to unions and employing organizations has been stud- 
ied only in the context of ongoing labor-management relationships. It may be 
equally or more important in relation to union efforts to organize new 
employers. The labor movement's current organizing difficulties give some 
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credibility to this view. Employees’ perceptions that labor is too adversarial 
are among the causes to which these difficulties have been traced. Employees’ 
tolerances for role dilemmas imposed by membership in conflicting systems 
and their expectations about the likelihood of encountering such dilemmas 
influence their votes in union representation elections. A no-union vote may 
reflect employees’ preferring to avoid the problem of divided loyalties. Again, 
if this conjecture is correct, it implies that it will be necessary for unions to 
change their adversarial images and to develop cooperation-oriented strate- 
gies. 

Bakke (1946) delivered the message that unions and management share a 
superordinate goal of mutual survival. As Whyte (1949) pointed out, labor 
unions and their management counterparts comprise “two social systems 
made up of highly interdependent parts... one is mutually dependent upon 
the other” (1949:13). In a similar vein, Drucker (1949) predicted that man- 
agement and unions might someday destroy each other unless they could 
establish conditions under which their shared members could be loyal to 
one without having to abandon loyalty to the other. We believe that the data 
reported here substantiate the views of these eminent scholars. 
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APPENDIX 
Scales and Included Items 


Labor-management relationship climate, managers’ form? 


bh bh kä 
hä 


13. 
14. 
15. 
16. 
17. 
18. 
19. 
20. 


21. 
22. 
23. 
24. 
25. 


RPO MONA TR wh 


In general: 


. The union is reasonable in dealing with management. 
. The union weakens employee discipline.» 


The union will give in to management when management is right. 
The union deals openly with management. 

The union abuses its power. 

The union tries to cooperate with management. 


. The union doesn't understand management’s problems." 
. Management's relations with the union are satisfactory. 
. The union and management are natural enemies. 
. In this organization, relations with the union are better than they used to be. 

. I expect a strike at this organization within the coming year.> 

. There are many rules which tend to dictate the settlement reached on specific issues‘ . 


b 


In negotiations: 


The union uses pressure tactics. 

The union and management are hostile toward each other.h 

The union makes concessions to avoid problems. 

The union won't listen to new ideas.> 

The union and management share most information. 

The union won't give in on anything unless they are forced.> 

Both sides are willing to try new solutions. 

The union and management work together to try to find creative solutions to problems. 


In dealing with grievances: 


The union tries to understand management's side. 

The union uses pressure tactics.> 

The union is more interested in supporting its members than in “what is right.” 
Management shares relevant information with the union.* 

The union shares all relevant information with management. 


Labor-management relationship climate, union form 


The union form was an adaptation of the management form. We reworded 24 of the items in 


a straightforward manner so that union leaders could indicate their perceptions of management. 
Example: Management is reasonable in dealing with the union. For item 2, however, simple re- 
wording was not feasible. The item was rewritten as: Management tries to interfere in internal 
union affairs.» 


Union commitment 


1. 
2. 
3. 
4. 


I am loyal to the union. 

I talk up this union to my friends as a great union to belong to. 
I find it hard to agree with my union's policies.» 

I don't care if the union survives.» 


Dual commitment 


1. 


oo hw hä 


It is easy to be loyal to both the union and management. 


. The management makes it easy to conduct union business. 

. The management makes it difficult for me to talk to my group representative or job steward.> 
. You can't be a union member and support management at the same time.» 

. The union helps me deal effectively with management. 

. Union members don't like it if you try to help management improve work effectiveness.° 
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Union participation 
Are you currently serving, or during the last jive years, have you served in any of the following 
union offices? 


ODN AA wn 


. Local president, secretary, treasurer, etc. 
. Local steward or group representative. 

. Local member of a committee. 

. District elected official. 

. District appointed committee. 

. Did you vote in the last union election? 


Have you read the union contract? 


. Do you read the union newsletter? 
. Have you filed a grievance in the past two years? 


a Adapted from the University of Michigan Organizational Assessment Package. 
P Reverse-scored item. 
€ ftem deleted from scale to maximize reliability. 
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This laboratory study compared the effectiveness of the dialectical 
inquiry, devil's advocacy, and consensus approaches to strategic deci- 
sion making by groups. Results showed that both dialectical inquiry 
and devil's advocacy led to higher quality recommendations and assump- 
tions than consensus. Dialectical inquiry was also more effective than 
devil's advocacy with respect to the quality of assumptions brought to 
the surface. However, subjects in the consensus groups expressed more 
satisfaction and desire to continue to work with their groups and greater 
acceptance of their groups' decisions than did subjects in either of the 
two other types of group studied. 


Top managers are often required to anticipate, define, and solve “wicked” 
strategic problems. Several elements characterize these complex problems: 
(1) numerous complicated linkages among organizational and environmen- 
tal elements; (2) dynamic and uncertain environments; (3) ambiguity of avail- 
able information; (4) lack of complete information; and (5) conflicts concern- 
ing the outcomes of decisions among interested parties (Mason & Mitroff, 
1981). These characteristics produce decision environments in which single 
right recommendations are rarely apparent. Such complexity often forces 
top managers to draw inferences and assumptions about their organizations 
and environments from available information and then to try to define and 
solve problems. 

Usually groups of managers with a variety of information and different 
perspectives address strategic problems (Brodwin & Bourgeois, 1984). Dis- 
cussion and other interaction among top executives are frequent means of 
sharing and evaluating information and ensuing inferences, assumptions, 
and recommendations (Glueck, 1980; Mintzberg, Raisinghani, & Theoret, 
1976; Springer & Hofer, 1978; Stagner, 1969). The group process may also be 
important in gaining commitment to and acceptance of decisions among 
individuals who ultimately will be responsible for implementing such deci- 
sions (Mason & Mitroff, 1981). 
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Group processes themselves do not ensure that top managers will ade- 
quately explore available information. Groups frequently avoid uncertainty 
and prematurely smooth over conflict (Brodwin & Bourgeois, 1984). Janis 
(1972) clearly described the problems inherent in consensus-seeking groups 
that value group harmony more highly than critical evaluation. The nature of 
interactions and the processes by which information is shared and evaluated 
appear to be critical factors in effective strategic decisions by top manage- 
ment groups. Thorough evaluations and critiques of inferences and assump- 
tions are essential (Janis, 1972). The inferences and assumptions that survive 
such an evaluation and critique are more likely to be valid than those that do 
not survive (Mitroff, 1982b; Mitroff & Mason, 1981). Without such a rigorous 
process it is probable that decision makers will retain intuitively obvious but 
invalid assumptions that will reduce the quality of groups’ solutions and 
final recommendations. 

During the past few years, there has been much debate and comparative 
research on the best approach for structuring strategic decision making groups. 
Researchers have paid the most attention to two approaches that rely on the 
use of constructive group conflict, dialectical inquiry and devil’s advocacy 
(Cosier, 1978, 1980; Cosier & Aplin, 1980; Cosier, Ruble, & Aplin,1978; 
Lourenco & Glidewell, 1974; Mitroff, Barabba, & Kilmann, 1977; Schwenk, 
1982b, 1984; Schwenk & Cosier, 1980). Dialectical inquiry uses debates 
between diametric sets of recommendations and assumptions, whereas 
devil’s advocacy relies on critiques of single sets of recommendations and 
assumptions. In spite of the amount of research on these approaches, debate 
continues concerning the validity of the findings from these studies and the 
conclusions that can be drawn from them (Cosier, 1981, 1982, 1983; Mitroff, 
1982a, 1982b; Mitroff & Mason, 1981; Schwenk, 1982a, 1982c, 1983). 

On the basis of two field studies (Lourenco and Glidewell, 1974; Mitroff 
et al., 1977), Mitroff and his colleagues contended that dialectical inquiry is 
the best approach available for improving strategic decisions. In turn, Cosier 
and his associates argued that these two studies have not been able to estab- 
lish a causal relationship between this approach and decision performance. 
On the basis of a series of laboratory experiments, they concluded that devil’s 
advocacy may be more effective than dialectical inquiry. Although these 
laboratory studies have permitted causal interpretation, a recent study by 
Schweiger and Finger (1984) suggested that inadequate experimental tasks, 
conceptualizations, and operational definitions of the two approaches may 
have prevented these studies from capturing their essences. These studies 
relied on such tasks as the Multi-Cue Probability Learning Paradigm, a cre- 
ative problem-solving task, and case studies. All of these studies required 
otherwise naive individuals to make decisions or judgments after either 
written or videotaped exposure to contrived recommendations and support- 
ing assumptions made by perceived experts. In the case of dialectical inquiry, 
studies presented plans and counterplans; for devil’s advocacy, plans and 
critiques. These research efforts studied neither approach as a group process 
nor were there any opportunities for subjects to participate in development 
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of recommendations, assumptions, or critiques. The conceptualizations and 
operational definitions used in previous laboratory studies were at variance 
with those originally proposed by Mitroff, Mason, and their colleagues 
(Mason, 1969; Mason & Mitroff, 1981; Mitroff, 1982a, 1982b; Mitroff & 
Emshoff, 1979). 

An examination of how the two approaches were intended to function 
will illustrate weaknesses in previous laboratory research. Both dialectical 
inquiry and devil’s advocacy recommend the use of constructive conflict in 
group decision making. Formalized argumentation and debate among top 
managers allow assumptions and recommendations to be systematically eval- 
uated and the strengths and weaknesses of each to be made explicit (Mason 
& Mitroff, 1981). Therefore, an adequate assessment of the effectiveness of 
the two approaches requires a research design that incorporates active inter- 
action among participants in decision making groups. As Mitroff succinctly 
noted: 


the dialectical inquiry method [and devil’s advocate method are] 
. above all ... learning processes whereby through active, 

heated, and intense debate between and among interested par- 

ties (not naive subjects) the parties came to discover and to invent 

entirely new alternatives as well as elaborate on old ones (1982b: 

222). 
Although both approaches employ conflict as their primary structural 
mechanism, there are inherent differences between them that also suggest 
differences in their effectiveness. According to its proponents (Mitroff, 1982b; 
Mitroff & Mason, 1981), dialectical inquiry derives from Hegel’s dialectic. 
They argue that high quality decisions will result from pitting diametrically 
opposed assumptions and recommendations—a thesis and antithesis—against 
each other. Debate of the two positions that maximizes conflict and critical 
evaluation will allow a synthesis to evolve that retains assumptions and 
recommendations that have survived the test. Dialectical inquiry uses two 
sets of recommendations; devil’s advocacy relies on one set and a critique of 
that set. Mason (1969) suggested that dialectical inquiry should lead to higher 
quality solutions than devil’s advocacy because the latter focuses only on 
what is wrong with assumptions and recommendations rather than on find- 
ing or identifying suitable alternatives. Essentially, devil’s advocacy is less 
effective than dialectical inquiry in developing alternative sets of assump- 
tions and recommendations. 

Scholars have also argued that the conflict orientation of both approaches 
should produce higher quality decisions than group approaches that do not 
formally structure argumentation and debate (Mason & Mitroff, 1981; Mitroff, 
Emshoff, & Kilmann, 1979). Schools and organizations commonly teach one 
such approach, consensus, and top management groups often use it in deci- 
sion making (Bourgeois, 1980; Business Week, 1978; Mintzberg et al., 1976; 
Quinn, 1980). The consensus approach has been found to lead to higher 
quality decisions than conventional group interaction does (Nemiroff, 
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Pasmore, & Ford, 1976).1 Proponents suggest that all members of consensus 
decision making groups are encouraged to state their assumptions and recom- 
mendations and then freely discuss them until they reach final decisions, 
whereas in conventional groups this is not always the case (Hall, 1971). 
Although the consensus approach advocates free expression in groups, 
it provides no formal procedure for testing and evaluating these.expressions. 
Given the frequent use of consensus in top management groups, it was quite 
important that we include this approach in the study. Moreover, its inclu- 
sion provided an excellent opportunity to compare the effects of formal 
structured intragroup conflict techniques with a less structured technique. 
Although the quality of assumptions and recommendations is important 

to the effectiveness of decisions by top management groups, group members’ 
satisfaction and desire to continue to work together on subsequent tasks are 
also important, especially in ongoing groups (Nadler, Hackman, & Lawler, 
1979). In structured conflict the inherent argumentation and debate 

can lead to damaged feelings when the members resolutely live 

up to their role as critical evaluators and take each other’s pro- 

posals over the bumps. Feelings of rejection, depression, and 

anger might be evoked so often when this role assignment is put 

into practice that it could have a corrosive effect on morale and 

working relations within the group (Janis, 1972: 210). 
Results of a survey of American Management Association executives 
(Schmidt, 1974) demonstrated these concerns. The survey found that although 
conflict has positive effects, as discussed earlier in this paper, negative out- 
comes included individuals feeling defeated and demeaned, increased dis- 
tance between people, suspicion and distrust, and a breaking down of 
teamwork. In a discussion of decision making in boardrooms, Welty (1983) 
suggested that consensus creates a less combative environment than debate. 
Although both dialectical inquiry and devil’s advocacy are conflict-based 
approaches, the first should lead to greater conflict than the second. To the 
extent that participants have emotional stakes in the assumptions and recom- 
mendations they have developed, dialectical inquiry could create greater 
conflict since it requires the development of diametric positions. Devil’s 
advocacy requires only one set of assumptions and recommendations; those 
who prepare critiques may develop smaller emotional stakes in their views 
than do those who develop alternate recommendations. Thus, we offer the 
following hypotheses concerning the effects of dialectical inquiry, devil’s 
advocacy, and consensus on group performance and individual members’ 
perceptions of groups: 


1 It is important to note that the conventional groups used as controls in Nemeroff and 
colleagues’ (1976) study were not instructed as to the process they should use and were left to 
their own devices in arriving at group decisions. These groups were found to have used majority 
vote, averaged rankings, trading, and political power among individuals to make decisions. 
Thus, these groups were not truly control groups. Since groups develop some process, whether 
instructed to or not, it is much more useful to compare groups trained in particular group 
approaches than to let them vary. 
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Hypothesis 1: There will be differences in performance 
between groups using different approaches to group deci- 
sion making. 


a: Dialectical inquiry groups will perform better than 
both devil’s advocacy and consensus groups. 


b: Devil’s advocacy groups will perform better than 
consensus groups. 


Hypothesis 2: Among groups using different approaches 
to group decision making, there will be differences in 
members’ satisfaction with groups and in their desires to 
continue to work with the same groups on subsequent 
tasks. 


a: Members of consensus groups will have greater sot. 
isfaction and desire to continue working with the groups 
on subsequent tasks than will members of dialectical in- 
quiry or devil’s advocacy groups. 


b: Members of devil’s advocacy groups will have 
greater satisfaction and desire to continue working with 
the groups on subsequent tasks than will members of dia- 
lectical inquiry groups. 


Since all three group decision making approaches are designed to en- 
courage group members to evaluate and reevaluate critically their assump- 
tions and recommendations, we also attempted to determine which of these 
methods was most successful in accomplishing this. Therefore, we also tested 
the following hypotheses: 


Hypothesis 3: There will be differences in levels of criti- 
cal evaluation of assumptions and recommendations 
among members of groups using different approaches to 
group decision making. 


a: Dialectical inquiry groups will produce more crit- 
ical evaluation of assumptions and recommendations 
among group members than either devil’s advocacy or 
consensus groups. 


b: Devil's advocacy groups will produce more critical 
evaluation of assumptions and recommendations among 
group members than consensus groups. 


METHODS 
Subjects 


Participants were 120 M.B.A. students at the University of Houston. 
Their mean age was 29 years (s.d. = 5.2); their mean number of years of 
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full-time work experience was 6.4 (s.d. = 4.9); and their mean number of 
years of managerial experience was 2.3 (s.d. = 3.9). With respect to highest 
level of managerial experience, 45.8 percent had never been managers, 34.2 
percent had been lower-level managers, 15.8 percent had been middle-level 
managers, and 3.3 percent had been top-level managers. Seventy-two were 
men and 47 were women. Demographic data were missing for one individual. 
Subjects were randomly assigned to four-person groups and each group 
was randomly assigned to either a dialectical inquiry, devil’s advocacy, or 
consensus condition. Thus each experimental condition had ten groups. 


Procedures 


The study ran simultaneously in three sections of a semester-long 
M.B.A. course in corporate strategy and policy. We chose this course be- 
cause the decisions required in it were similar to those for which the three 
approaches to decision making were designed. The procedures used in the 
three sections were identical. Participation was a graded course requirement 
and the study took place during class time. We began collecting data at mid- 
semester to allow time for the students to have developed the background 
necessary for making strategic decisions. Students had been exposed to seven 
weeks of lectures, readings, and cases on strategic management prior to start 
of the study. 

At the end of the class period preceding the actual administration of the 
experimental conditions, subjects received a copy of the case that would be 
used in the study. They were instructed to read the case, analyze it, prepare 
written recommendations and lists of their assumptions, and to be prepared 
to discuss the case in depth at the next class meeting. To ensure that sub- 
jects would be prepared, they were required to turn in copies of their assump- 
tions and recommendations for grading. All complied. It was essential that 
this initial preparation of the case be conducted at home since it would have 
otherwise consumed an inordinate amount of class time. It was also reason- 
able to expect that managers would prepare themselves before group 
meetings. Subjects were part-time night students; lectures had not previously 
discussed any of the experimental conditions, and subjects got the case only 
two days before the experimental session. Thus, it seemed unlikely that 
preparation at home would confound the research. 

Upon the arrival at the experimental session—the next class—subjects 
learned that they would now be working on the same case in four-person 
groups. After random assignment to groups they went with escorts to sepa- 
rate rooms for each approach. They then received instructions describing 
the group approach that they were to use to generate assumptions and recom- 
mendations based on the case. Experimenters also read the instructions 
aloud and answered questions about them. Finally, they isolated groups in 
their separate locations to begin their work. 

To avoid possible experimenter and class section confounds, we took 
three precautions in designing and administering this experiment. First, 
groups were equally assigned to experimental conditions across classes. 
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Second, the three experimenters who gave and read the group instructions 
were randomly assigned to the different conditions across the three classes. 
Finally, the experimenters were instructed to answer only questions that 
requested clarification of the manipulation instructions. 

Groups had three hours, amounting to two class sessions, in which to 
generate and record their final recommendations and assumptions. Sub- 
jects were instructed not to discuss the assignment with anyone outside their 
group or subgroup? between class sessions. Each room contained a tape 
recorder that was used for manipulation checks. All groups were notified 
when 15 minutes remained in their work time. When time ran out, experi- 
menters returned, collected tape recorders and all materials, and adminis- 
tered a 12-item questionnaire that included manipulation checks and ques- 
tions eliciting group members' perceptions. A demographic information form 
was also administered. Subjects were fully debriefed in a subsequent class 
meeting. 


Task 


The study used the Leitch Quality Drug Company case (Glueck, 1980). 
We chose this relatively brief case because it presents subjects with a num- 
ber of real strategic problems and allows analysis in a reasonable amount of 
time. The case presents a situation that confronted an actual but disguised 
company and the four partners who owned it. The company, which operated 
three drugstores in Orlando, Florida, faced a number of significant changes 
in its environment as well as a number of internal problems. Some of the 
changes included demographic shifts and the growth of large super-discount 
drugstores. The three stores operated in very different neighborhood markets. 
Internal problems included weak accounting systems, poor inventory control, 
and ineffective pricing policies. One of the partners, Mr. Neds, had urged the 
other three to make changes, but to no avail. 

Individuals and groups were instructed to analyze the company's situa- 
tion and address recommendations to Mr. Neds. They were also instructed to 
state the facts and assumptions that supported their recommendations. 


Experimental Manipulations 


Dialectical inquiry manipulation. Two studies (Mason, 1969; Mason & 
Mitroff, 1981) provided the basis for this manipulation. Ten groups received 
the following statement of information and instruction: 

In the dialectical inquiry approach two different recommendations, based on con- 
trary assumptions, are developed from the same data. The two recommendations and 
their respective assumptions are subjected to an in-depth, critical evaluation through 
a debate between two advocate subgroups. Using the same data, the debaters attempt 
to spell out the implications of each decision, reveal its underlying assumptions, and 
challenge (or defend) those assumptions as effectively as possible. In other words, 


? Two subgroups were formed with each dialectical inquiry and devil's advocacy group. 
Each subgroup's responsibilities are defined in the Manipulation section of the paper. 
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each side is trying to win the debate. Following the debate the two advocate sub- 
groups should settle on which assumptions survived the scrutiny of debate and 
attempt to develop a recommendation based on them. Proponents of the dialectical 
inquiry decision making approach believe that sound judgments or recommendations 
are more likely to result from thorough identification and criticism of proposed de- 
cisions and their underlying assumptions. Typical guidelines and procedures used in 
the dialectical inquiry approach follow: 

1. Divide your four-person group into two (2) two-person advocate subgroups. 

2a. One subgroup should develop recommendations and build an argument for them, 
supported by all key assumptions, facts, and data that underlie them. Write the 
recommendations, assumptions, facts, and data as completely and lucidly as you 
can on the Subgroup I Recommendations form provided for this purpose. Return 
the form to the other subgroup. 

2b. The other subgroup should await receipt of the list of key assumptions made by 
the first group. While waiting, you may discuss the case only between yourselves. 
Upon receiving this list of key assumptions, the second subgroup should develop 
plausible assumptions which negate the first subgroup’s key assumptions. Using 
these new assumptions, the second subgroup should develop counter-recom- 
mendations and complete the Subgroup 2 Recommendations form. 

3. Following step 2b, both advocate subgroups should present both orally and in 
writing their assumptions, recommendations, and supporting facts and data to the 
other subgroup. 

4. The two advocate subgroups debate their recommendations and the validity of the 
assumptions they have made. The objective of this debate is to arrive at a final 
list of assumptions that is acceptable to both subgroups. 

5. Once the debate is completed, you should reach agreement on which assumptions 

survived. Any new assumptions that arise from the debate should also be included. 

Using the surviving assumptions, develop recommendations. 

Record the final recommendations, assumptions, facts, and data on the Final Re- 

commendations form provided. 

Devil's advocacy manipulation. This manipulation was based on Cosier 


(1981) and Mason and Mitroíf (1981). Ten groups received the following 
. statement of information and instruction: 


The devil’s advocacy approach develops a solid argument for a reasonable recom- 
mendation, then subjects that recommendation to an in-depth, formal critique. The 
critique calls into question the assumptions and recommendations presented to the 
devil’s advocate, and attempts to show why the recommendations should not be 
adopted. Through repeated criticism and revision, the approach leads to mutual ac- 
ceptance of a recommendation. Proponents of this decision making approach believe 
that good recommendations and assumptions will survive even the most forceful and 
effective criticism and that this approach is more likely to yield sound judgments or 
recommendations. Here are some guidelines and procedures to follow in using the 
devil’s advocate approach: 

1. Divide your four-person group into two (2) two-person subgroups. Assign one sub- 
group the formal role of devil’s advocate. Separate into your subgroups. 

2. Discuss the Leitch Quality Drugs case with your subgroup partner. 

3a. The subgroup that is not devil’s advocate then should develop a set of recom- 
mendations and build an argument for them, supported by all key assumptions, 
facts, and data that underlie them. Write the recommendations, assumptions, 
facts, and data as clearly and thoroughly as you can on the Subgroup I Recom- 
mendations form provided for this purpose. 

3b. Meanwhile, the devil’s advocate subgroup should prepare for their critique by dis- 
cussing the case and any critical assumptions, data, etc., which they can identify. 


"9 
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4. The first subgroup presents its written recommendations and assumptions to the 
devil’s advocate subgroup. The devil’s advocate subgroup subjects the recom- 
mendations to a formal critique. The critique attempts to uncover all that is wrong 
with the recommendations, assumptions, facts, and data and to expound the rea- 
sons why the recommendations should not be adopted. 

5. Following Step 4, the critique is presented to the first subgroup orally and on the 
Critique form provided. The first subgroup then meets separately once again and 
revises its recommendations to satisfy the valid criticisms of the devil’s advocate 
subgroups. 

6. Repeat Steps 4 and 5 until both subgroups can accept the recommendations, as- 
sumptions, and data. 

7. Write the final recommendations, assumptions, facts, and data on the Final Re- 
commendations form with which you have been provided. 


Consensus manipulation. This manipulation was based on Hall (1971) 
and Nemiroff and colleagues (1976). Ten groups received the folowing state- 
ment of information and instruction: 


The consensus approach relies on a thorough, open, and constructive discussion 
and examination of the recommendations and underlying assumptions developed in- 
dividually by the group members. In the course of this discussion each group mem- 
ber should have the opportunity to present his recommendations, the underlying as- 
sumptions, and relevant facts and data in as clear and logical a manner as possible. 
Through discussion, questioning, and more complete exchange of information and 
opinion, the group seeks a better recommendation than might be produced by a single 
person. It is not necessary that each person be completely satisfied with the assump- 
tions and recommendations—only that each can accept them on the basis of logic 
and a willingness to consider them as feasible. Consensus is said to exist when all 
group members can accept the assumptions and recommendations on this basis. 
Proponents of this decision making approach believe that better assumptions and re- 
commendations result from a more complete investigation and airing of data and 
ideas and a logical resolution of differences within the group. Here are some guide- 
lines to use in achieving consensus: 

1. Avoid arguing blindly for your own assumptions and recommendations. Present 
your position clearly, logically, and persuasively, but consider carefully the com- 
ments and reactions of other group members. If you present the same points again, 
take comments and reactions into account. 

2. Avoid making ‘‘win-lose” statements in your discussion. Discard the notion that 
someone must win and someone lose in the discussion. When impasses occur, 
look for the next most acceptable solution for all parties. 

3. Avoid changing your mind simply to avoid conflict and reach agreement. With- 
stand pressures to yield which have no logically sound foundation. Strive for en- 
lightened flexibility; avoid mere capitulation. 

4. Avoid conflict reducing techniques such as majority voting, tossing a coin, and 
the like. Differences of opinion indicate an incomplete exchange of relevant infor- 
mation on someone's part; press for additional sharing of task or emotional data 
where it seems in order. 

5. View differences of opinion as natural and helpful rather than as a hindrance to de- 
cision making. Generally, the more assumptions and recommendations expressed, 
the greater the likelihood of canflict, and the richer the resources used in solving 
the problem at hand. 

6. View all initial agreements as suspect. Explore the reasons for the apparent agree- 
ment. Make sure that people have arrived at similar recommendations either for 
the same reasons or for complementary reasons before incorporating such recom- 
mendations into the group’s final set. 
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7. At the conclusion of the group’s work, record the group’s final assumptions, 
. recommendations, facts, and data on the Final Recommendations form provided. 


Dependent Measures 


Group performance. Judges' ratings of the groups' final recommenda- 
tions and supporting assumptions were used to assess performance. The 
judges were educators and researchers in business policy and strategy. One 
of the judges, in fact, wrote the published teaching note for the case used in 
the study. Two judges independently rated assumptions on three dimensions: 
(1) the number of assumptions groups brought to the surface, (2) the valid- 
ity of the assumptions made with respect to information provided in the 
case, and (3) the importance of the assumptions with respect to the deci- 
sions made. 

Judges computed scores for numbers of assumptions by counting the 
assumptions that each group provided. They rated the validity of each 
assumption on a 5-point scale ranging from 1, low confidence in the validity 
of the assumption, to 5, high confidence in the validity of the assumption. 
They rated the importance of each assumption on a 5-point scale ranging 
from 1, not important, to 5, very important. Each assumption was rated on 
both its validity and importance. Final group validity scores were deter- 
mined by summing the validity ratings of each group's assumptions and 
then dividing the sum by the number of assumptions the group brought to 
the surface. Judges computed group importance scores in a like manner. The 
study used validity and importance averages because otherwise scores of 
groups that surfaced many low-rated assumptions could be inflated beyond 
those of groups that surfaced fewer, but more key assumptions. 

The two judges also independently rated the overall quality of each 
group's recommendations on a 5-point scale. The scale ranged from 1, low 
quality recommendation, to 5, high quality recommendation. 

Group members' perceptions. A 12-item questionnaire was administered 
to each group member immediately after a group's work was completed. The 
questionnaire asked the extent to which respondents agreed with statements 
describing perceptions of and reactions to the decision making experience. 
Included in the questionnaire were six items used as manipulation checks, 
two items for each manipulation; two items assessing satisfaction and desire 
to continue to work with the groups; two items assessing the extent to which 
group condition led members to critically reevaluate their personal assump- 
tions and recommendations; and two items assessing acceptance of groups' 
assumptions and recommendations. The Appendix presents the question- 
naire items, response formats, and internal consistency reliabilities. 

Tape recordings. Each group's work session was tape recorded for use as 
a manipulation check. Using as their criteria the three written manipulations 
provided to the groups, two judges listened to the tapes and independently 
classified each group into one of the three decision making conditions. 
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RESULTS 
Manipulation Checks 


Responses to the six questionnaire items and the tape recordings were 
used to determine the degree to which experimental manipulations suc- 
ceeded. The questionnaire items allowed assessment of individuals and the 
tapes allowed assessment of groups. 

Since manipulation checks with two items were used for each condition, 
we computed internal consistency estimates using Cronbach’s alpha. The 
scales demonstrated reasonable internal consistency (see the Appendix). 
Therefore, we retained all three checks for further analysis. As expected, 
agreement with the dialectical inquiry manipulation check differed signifi- 
cantly by condition (F» 417 — 66.7, p<.001). Further, individuals in the dialecti- 
cal inquiry condition differed significantly from those in the other two condi- 
tions (p<.01, planned t-tests). Agreement with the devil’s advocacy manipula- 
tion check also differed significantly by condition (F2 117 —86.1, p<.001). 
Planned t-tests also indicated that individuals in the devil's advocacy condi- 
tion differed significantly (p«.001) from those in the other two conditions. 
Finally, agreement with the consensus manipulation check also differed 
significantly by condition (F2 447 — 66.4, p.001). Planned t-tests indicated 
that individuals in the consensus condition differed significantly from those 
in the other two conditions (p<.001). To further assess the extent to which 
the manipulations were successful, we conducted a discriminant analysis on 
the manipulation checks for individuals. The analysis classified 90 percent 
of all the individual checks correctly as to group condition. 

The judges' classifications of the tapes gave less support for the success of 
the manipulations. The two judges agreed on only 70 percent of the classi- 
fications, 21 of 30 classifications made. Of the 9 classifications on which 
judges did not agree, 7 involved dialectical inquiry or devil's advocacy 
groups. This suggests that the distinctions between these two approaches are 
not always apparent in practice. Use of two subgroups in both conditions 
might have caused overlap. Follow-up discussions between the two judges 
brought resolution of the disagreements. Although they agreed on final 
classifications for all 30 tapes, the judges still had questions concerning 
several of them. 

To assess the success of the manipulations, judges' classifications were 
compared to the actual manipulations. There was a match on 23 of 30 class- 
ifications, or 76 percent. Of the seven remaining classifications, three dialect- 
ical inquiry manipulations were classified as devil's advocacy, two devil's 
advocacy manipulations were classified as dialectical inquiry, one dialectical 
inquiry was classified as consensus, and one consensus was classified as dia- 
lectical inquiry. Overall, the individual level manipulation checks suggest 
that the manipulations were successful. However, the taped manipulation 
checks provided only moderate support for the success of the manipulations. 
The discrepancy between the individual and taped manipulation checks 
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may be partially due to the priming effects that the manipulation instruc- 
tions may have had on individuals’ questionnaire responses. 


Group Performance® 


Table 1 presents means and standard deviations for the measures of 
group performance. Table 2 summarizes the results for each measure, which 
are also discussed in the next section. 

Number of assumptions. Before applying the number of assumptions 
measure, we assessed interrater reliability using correlations and percentage of 
agreement between judges. The correlation between the judges was .74; they 
agreed on 76 percent or on 23 of 30 groups. In only one case was there large 
disagreement between the judges; this disagreement focused on whether cer- 
tain statements were in fact assumptions. Since six of the differences were 
small, we resolved them through averaging; the seventh was resolved through 
discussion. The number of assumptions groups brought to the surface did not 
differ significantly across the experimental conditions. 

Quality of assumptions. Two measures were used to assess the quality of 
the assumptions groups surfaced: validity and importance. For both measures 
of quality, we computed interrater correlations and percentage of agreement 


TABLE 1 
Means and Standard Deviations for Group Performance 
and Group Members’ Perceptions 





Type of Group Approach 


Dialectical Inquiry — Devil's Advocacy Consensus 
Dependent Variables Means s.d. Means s.d. Means s.d. 
Group performance 
Number of assumptions 12.50 3.57 11.65 5.96 13.40 5.47 
Validity of assumptions 4.27 0.39 3.81 0.52 3.16 0.30 
Importance of assumptions 4.02 0.29 3.72 0.50 3.06 0.23 
Group members' perceptions 
Satisfaction with and 
desire to continue to 
work with the group 7.23 1.46 7.13 1.24 8.23 1.41 
Critical evaluation _ 8.50 1.11 8.38 1.23 7.03 1.37 
Acceptance of group 
decision 8.50 1.20 7.30 1.36 8.15 1.00 
Confidence in group 
decision 7.23 1.33 7.60 1.32 7.68 0.89 


3 Because of the questionable results of the taped manipulation checks, we conducted 
analyses parallel to the ones presented in this section without the seven groups for which the 
judges' classifications did not agree with the actual manipulations. All analyses were identical 
for the full sample of 30 groups and the sample excluding the seven groups, except for the an- 
alysis involving critical evaluation. It was not significant in the smaller sample. 
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between judges. We eliminated redundant assumptions among the groups 
and assessed reliabilities on all unique assumptions. A total of 297 unique 
assumptions were judged. For the validity measure the correlation was .88. 
The actual level of agreement was 66 percent, with 22 percent of the assump- 
tion ratings deviating by one point on the 5-point scale and 9 percent deviating 
by two points. The correlation for the importance measure was .78. The actual 
level of agreement was 59 percent with 28 percent of the assumption ratings 
deviating by one point on the 5-point scale and 5 percent deviating by two 
points. Given the extreme difficulty in accurately rating so many assump- 
tions, we concluded that a reasonable level of interrater reliability had been 
established to retain both measures. Because differences between judges were 
not very large, we averaged the judges’ ratings to arrive at the final rating. 

The correlation between the validity and importance measures was .77 
(p<.001). This correlation and conceptual evidence suggesting that these 
measures are related required that we analyze validity and importance to- 
gether; the combination was termed quality of assumptions. A one-way 
MANOVA indicated that there was a significant difference in the quality of 
assumptions that surfaced among the conditions (F4 55 = 9.06, Wilks's lambda = 
.347, p<.001). To further understand the univariate effects, we computed 
F-tests for each dependent variable. The effects for both validity (F227= 
17.88, p<.001, «? = .53) and importance (F5 5; — 18.4, p<.001, o? —.54) were 
significant. As indicated in Table 2, all but one of the planned t-tests among 
the groups were also significant for both validity and importance of assump- 
tions. The exception was the comparison between dialectical inquiry and 
devil's advocacy for importance of assumptions. 

Quality of recommendations. Using the ratings for all 30 groups, we 
computed both correlation and percentage of agreement again for the two 
judges' ratings. In this case the correlation between the judges was .68. The 
level of agreement was 60 percent with 10 percent of the quality ratings 
deviating by one point on the 5-point scale and 10 percent deviating by two 
points. Twenty percent of the ratings deviated by three or more points. The 
moderate level of disagreement suggested that discussion rather than averag- 
ing would be the best way to resolve differences between the judges. All 
discrepancies were resolved. Given the moderate correlation and percentage 
of agreement between the judges, some caution is required in interpreting 
the results.using this measure. 

The results on decision quality indicate that there were significant differ- 
ences among groups (F; 5, = 3.73, p<.05, o? =.08). Further analysis using 
planned t-tests (see Table 2) found that both the dialectical inquiry and 
devil's advocacy groups made significantly higher quality recommendations 
than the consensus groups, but that there was no difference in quality between 
the first two types. 

Of additional interest was the relationship between the quality of recom- 
mendations and the quality and number of assumptions brought to the surface. 
The correlation between quality of recommendations and number of assump- 
tions was not significant (r=.10). The correlation between the quality of 
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TABLE 2 
: Results of Performance Comparisons among Group Conditions? 
Dependent Measures Results 
Quality of assumptions 
MANOVA 
Validity and importance ben = 9.06*** 
ANOVA 
Validity Fy 975 = 17.88*** 
Importance Raas = 18.43*** 
Number of assumptions Fans 1.33 
t-tests 
Validity 
Dialectical inquiry vs. devil's advocacy tig = 2.23* 
Dialectical inquiry vs. consensus Dn = 7.12*** 
Devil’s advocacy vs. consensus tig = 3.37** 
Importance 
Dialectical inquiry vs. devil's advocacy lg = 1.53 
Dialectical inquiry vs. consensus tg = 7.1B*** 
Devil’s advocacy vs. consensus tg = 377** 
Quality of recommendations 
ANOVA 
Quality of recommendation Faas = 3.73* 
t-tests 
Quality of recommendations 
Dialectical inquiry vs. devil’s advocacy tig = 123 
Dialectical inquiry vs. consensus tig = 2.34* 
Devil’s advocacy vs. consensus tg = 2.49% 


DN = 30, 10 per cell. 
‘Subscripts are degrees of freedom. 
*p«.05 
**p«.01 
***p«.001 


recommendations and the quality of assumptions—the combined validity 
and importance scores—was barely significant (r = .39, p<.10). Although not 
very strong, the latter correlation supports numerous decision theorists who 
have suggested that bringing assumptions to the surface and analyzing them 
are crucial for high quality recommendations. We can make no causal infer- 
ferences here about how individuals thought through the assumptions and 
recommendations, since it is possible that they first made their recommenda- 
tions and only afterwards supported them with assumptions. This possibil- 
ity was considered unlikely, but it should be noted that this study had no 
controls to prevent it. With respect to the groups themselves, the contents of 
the tape recordings indicate that all the groups initially arrived et their 
assumptions prior to developing the recommendations. However, a few groups 
modified assumptions as they evaluated recommendations; there was no 
relationship between experimental condition and these modifications. 
Group members’ perceptions. Table 1 presents means and standard de- 
viations for the measures of group members' perceptions. Table 3 presents 
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results for each individual measure, which are also discussed next in this 
section. 

Satisfaction and desire to continue to work with groups. We used a 
2-item scale developed from Tjosvold and Field (1983) to measure satisfaction 
with groups. Since the scale demonstrated reasonable internal consistency 
we retained it for further analysis. In addition to performance differences 
among the three conditions, there were also significant differences in satis- 
faction and desire to work with groups (F3 4,7 = 7.86, p<.001, w? = .10). 
Planned t-tests presented in Table 3 indicate that individuals in the consen- 
sus groups were significantly more satisfied and desirous of working with 
their groups in the future than members of either the devil’s advocacy or 
dialectical inquiry groups. No difference emerged, however, between the 
latter two types of group. These data suggest that programmed conflict tech- 
niques lead to lower satisfaction among group members. 

Critical evaluation. The 2-item measure used to assess this construct 
demonstrated low internal consistency. Therefore, caution is needed in inter- 
preting the findings using it. The results indicate a significant difference 
among the conflict methods in their ability to induce members to subject 
their own ideas to critical evaluation (F; 4,7; = 17.36, p<.001, e? = .04). 
Planned t-tests (see Table 3) indicated that both dialectical inquiry and devil's 





TABLE 3 
Group Members' Perceptions across Group Conditions? 
Dependent Measures Results 
Satisfaction and desire to continue to work with group 
ANOVA j Fz? = 7.86*** 
t-tests 
Dialectical inquiry vs. devil’s advocacy tg = -3.71*** 
Dialectical inquiry vs. consensus tg = -3.12** 
Devil’s advocacy vs. consensus tza = -0.33 
Critical evaluation of assumptions 
ANOVA Fa = 17.36** 
t-tests 
Dialectical inquiry vs. devil’s advocacy Le = 0.48 
Dialectical inquiry vs. consensus Le = 5,30*** 
Dialectical advocacy vs. consensus Lon = 4.63*** 
Acceptance of group decisions 
ANOVA Puy = 19.02*** 
Duncan Multiple Range test 
Dialectical inquiry vs. devil's advocacy Ww > .688** 
Dialectical inquiry vs. consensus Ww > .718** 
Devil’s advocacy vs. consensus w >  .688** 


aN= 120, 40 per cell. 
bSubscripts are degrees of freedom. 
*p«.05 
**p<.01 
***p«.001 
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advocacy were more capable of stimulating critical evaluation than consen- 
sus, but that there was no difference between dialectical inquiry and devil's 
advocacy. 

Acceptance of group decisions. Although no hypothesis addressed it, 
we took an additional measure commonly used in research on group deci- 
sion making (Tjosvold & Field, 1983), acceptance of a group decision. A 
2-item acceptance measure yielded reasonable internal consistency. 

There was a significant dicference among the group conditions on 
acceptance (F, 417 = 19.02, p«.001, o? = .24). Since differences were not 
hypothesized a priori, specific dizferences among the group means could not 
be assessed with planned comparisons. However, post hoc comparisons using 
Duncan's multiple range test (see Table 3) indicated that there was greater 
acceptance of decisions with the consensus condition than with either of the 
two other conditions. The devil's advocacy condition also led to greater ac- 
ceptance than the dialectical inquiry condition. 


DISCUSSION 


This study, the first controlled laboratory experiment to assess the com- 
parative effectiveness of dialectical inquiry, devil's advocacy, and consensus 
in a group context, suggests that programmed conflict is useful in improving 
the quality of strategic decisions. In particular, the results obtained using 
both the full set of groups and excluding the seven groups suggest that 
both dialectical inquiry and devil's advocacy were more effective than con- 
sensus in generating high quality recommendations and underlying assump- 
tions. Although dialectical inquiry was more effective than devil's advocacy 
with respect to the quality of th» assumptions, there were no differences 
with respect to the quality of the recommendations. These results support 
Mitroff's general proposition that dialectical inquiry is more effective than 
devil's advocacy in bringing high quality assumptions to the surface, but fails 
to confirm the proposition that dialectical inquiry is more effective than 
devil's advocacy in the development of quality recommendations. Although 
the previously cited studies by Cosier and his colleagues have been criti- 
cized on conceptual grounds (Mitroff, 1982b; Schweiger & Finger, 1984), this 
study provided partial support for their findings. That is, it did not show 
dialectical inquiry to be clearly superior to devil's advocacy. Since there 
were significant differences between the two approaches in the assumptions 
generated and not in the recommendations, further research is suggested on 
how these two approaches function. ` 

Very little is known about how groups should actually use either approach 
to arrive at decisions. All that theorists have suggested is that a debate follow 
the critique in devil's advocacy and the negation in dialectical inquiry. Since 
neither method clearly specifies how groups should use acceptable sets of 
assumptions to construct final recommendations, it is conceivable that any 
benefits accrued in bringing assumptions to the surface are diluted as recom- 
mendations are developed. More research is needed to understand this tran- 
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sition from assumptions to recommendations. Although the correlation was 
not very strong, the quality of recommendations was significantly related to 
the quality of assumptions. Again, these findings support the views of Mit- 
roff and his colleagues. The lowness of this correlation, however, suggests 
that the assumptions and some degree of error were not the only factors af- 
fecting the quality of recommendations. Understanding the processes by 
which groups develop assumptions and use them to choose recommenda- 
tions may be quite instrumental to understanding the limits of these decision 
making approaches. It is possible that the processes used to arrive at assump- 
tions and recommendations differ. The strength of the ws further suggests 
that the group approaches had more of an effect on the quality of assump- 
tions raised than on the quality of recommendations. It is also possible that 
these approaches are easier to describe than to carry out. The taped mani- 
pulation checks may indicate that group members need to be trained in 
using the approaches or need to gain experience with them before they can 
successfully implement them. 

The results also suggest that those interested in selecting group decision 
making techniques should pay attention to context. Although consensus 
may be useful for some nonstrategic tasks, the evidence provided in this 
study clearly suggests that it is inferior to both dialectical inquiry and devil's 
advocacy with respect to the overall quality of the assumptions and recom- 
mendations made. This is not surprising, given Janis's (1972) concern that 
consensus may lead to too little conflict in the group decision making pro- 
cess and thus to uncritical acceptance of invalid assumptions and inferior 
recommendations. The results of this study partially support the idea that 
both dialectical inquiry and devil's advocacy lead to significantly greater 
critical evaluation of personal assumptions on the part of group members 
than consensus. However, problems of measurement and the nonsignificant 
results obtained from the analysis excluding the seven groups preclude any 
firm conclusions. More research is certainly warranted on the application of 
these techniques to specific types of problems and decisions. 

Concerning satisfaction with groups and desire to work with them in the 
future, the findings suggest that consensus may be more functional than 
either dialectical inquiry or devil's advocacy for preserving harmony within 
a group. This was also evident in the greater degree of acceptance of deci- 
sions in the consensus groups than in the dialectical inquiry and devil's 
advocacy groups. Overall, these results pose a dilemma in selecting a deci- 
sion making method. On the one hand, conflict improves decision quality; 
on the other, it may weaken the ability of a group to work together in the 
future. Obviously, this is less of a problem for ad hoc groups than for ongoing 
ones like top management teams. It is plausible that an ongoing group's 
ability to continue using the conflict procedures might decline over time. 
This research provided no evidence that explained why members of the 
dialectical inquiry and devil's advocacy groups were less satisfied. Perhaps 
people just do not view conflict favorably. A more likely explanation, 
however, is that the groups did not manage conflict properly. An actual 


68 Academy of Management Journal March 


organizational group needs norms and rewards that encourage constructive 
conflict (Brodwin & Bourgeois, 1984; Sussman & Herden, 1982). 

The findings on satisfaction and acceptance also have implications for 
decision implementation. Group members may be more likely to implement 
decisions that were reached through low rather than high conflict approaches. 
Again, this presents a dilemma—group members may not accept high quality 
decisions. Since we did not monitor and analyze the actual group processes 
involved in the three conditions, other than through the taped manipulation 
checks, it is difficult to determine why conflict reduced satisfaction and 
acceptance. Unfortunately, the difficulty of identifying individual interac- 
tions in our tapes prevented such an analysis. Future research needs to more 
fully address process in the three approaches. 

As does any study, this one has its limitations. Although a strategic 
decision making case was the task, it had certain weaknesses. Since the 
company was privately held, subjects received all available information from 
one source. In most strategic decisions, however, decision makers have to 
determine what information is needed and where to find it. In addition, 
those actually involved in the company would already have had certain 
industry, company, and task group experiences that subjects in this study 
did not begin with. 

A second limitation was the population of subjects used. These were not 
real-life strategic decision makers. However, the demographic data sug- 
gested that subjects had a reasonable degree of managerial experience. Also, 
subjects had studied a reasonable number of strategic concepts prior to 
working on the case. Even so, the task and the subjects suggest some caution 
in generalizing these findings to actual organizational strategic decisions and 
decision makers. 

A third limitation is the time frame of the study. The experiment ran for 
one week. Although managers may make many decisions this quickly, it is 
unlikely that comprehensive strategic decisions, such as were required here, 
would have been made in such a short time. Thus, the time frame may have 
affected the results. 

Fourth, the study did not include time as a performance variable. Groups 
had a fixed period within which to complete the task and had little incentive 
to finish more quickly. In practice, however, top management's time carries 
a high opportunity cost. Moreover, speed is a crucial consideration in some 
decisions. The best choice of decision making approach may depend on the 
expected value of incremental decision quality, the cost of additional man- 
agement work hours devoted to the decision, and the urgency of the situation. 

Finally, this study, like most group decision making studies conducted 
in laboratory settings, relies exclusively on ad hoc groups. Clearly, longitudi- 
nal studies of ongoing groups are needed. In particular, such studies may 
provide additional insight into the long-term effects of programmed group 
conflict on groups. One potential effect this study identified is the decline in 
group members’ willingness to work together after experiencing the conflict 
induced by dialectical inquiry or devil’s advocacy. A second contribution of 
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longitudinal studies could be to identify effects of learning on both individu- 
als and groups brought about by repeated exposure to these group decision 
approaches. Such learning could influence the relative effectiveness of these 
approaches over time. A final contribution of longitudinal studies would be 
to examine the effects that development of group norms and of shared 
information, perceptions, and values among members might have on sustain- 
ing the use of group conflict approaches (Beyer, 1981). 
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APPENDIX 


Manipulation Checks on Individuals and Items from Group 
Member Perception Questionnaire 


Six questionnaire items (two per manipulation) were used as manipulation checks. Group 
members evaluated the degree to which each item described what their group had done in 
arriving at its assumptions and recommendations. Responses were on the following 5-point 
scale: 1, strongly disagree; 2, disagree; 3, undecided; 4, agree; 5, strongly agree. 

Two items measured the dialectical inquiry manipulation: (1) the two subgroups consid- 
ered two conflicting sets of recommendations based on opposite sets of assumptions; and 
(2) on the basis of a debate over two sets of opposing assumptions developed by each subgroup, 
the group developed a pool of valid assumptions and based its recommendations on these as- 
sumptions. 

Two items measured the devil's advocacy manipulation: (1) one subgroup initially devel- 
oped a set of recommendations followed by a formal critique of it by a second subgroup; and 
(2) the group's final set of recommendations was based on the outcomes of a formal critique by 
one subgroup of the initial recommendations of another subgroup. 

Two items measured the consensus manipulation: (1) during the group meeting only, mem- 
bers of the group individually presented their personal recommendations and assumptions; and 
(2) the group's final set of recommendations was the combined, but not necessarily the aver- 
aged outcome of members' individual contributions. 

Alphas for the two-item dialectical inquiry, devil's advocacy, and consensus manipulation 
checks were .88, .84, and .63, respectively. 

Six additional items measured participants' reactions to their group experience. Group 
members evaluated the degree to which each statement described their feeling on the following 
5-point scale: 1, strongly disagree, 2, disagree; 3, undecided; 4, agree; 5, strongly agree. 

Two items measured satisfaction and desire to continue to work with the groups: (1) I 
would be willing to work with this group on other projects in the future; (2) working with my 
group was an enjoyable experience. 

Two items measured the extent to which the group condition led members to critically 

 reevaluate their own assumptions and recommendations: (1) the group decision process made 
me critically reevaluate the validity of the assumptions and recommendations that I made in my 
individual analysis; (2) the group decision process uncovered valid recommendations and 
assumptions that I did not consider in my individual analysis. 

Two items measured group decision acceptance: (1) I am committed to my group's recom- 
mendations and assumptions; (2) I am satisfied with my group's recommendations and 
assumptions. 

Alphas for the satisfaction, critical reevaluation, and acceptance measures were .80, .79, 
and .48, respectively. 
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ADMINISTRATIVE SUCCESSION 
AND ORGANIZATIONAL PERFORMANCE: 
HOW ADMINISTRATOR EXPERIENCE 
MEDIATES THE SUCCESSION EFFECT 


JEFFREY PFEFFER 
ALISON DAVIS-BLAKE 
Stanford University 


Analysis of'data for 22 teams in the National Basketball Association 
(NBA) indicated that succession has no effect on subsequent team per- 
formance when prior performance is controlled, a result consistent with 
those of other studies. However, when the competence of new coaches is 
included in the analysis, it apvears that succession affects subsequent 
performance. Teams in which succession occurred, and in which (1) the 
coach had a good prior record, (2) had prior experience in the NBA or 
ABA, or (3) had brought about performance improvements in other 
teams, performed better than teams whose new coaches either lacked 
experience or had performed less well in the past. The results suggest 
that attempts to estimate succession effects need to consider the compe- 
tence of successors. 


The effect of managerial replacement on organizational performance 
remains a debated issue in spite of a number of studies and renewed atten- 
tion to the problem (e.g., Allen, Panian, & Lotz, 1979; Brown, 1982). As Allen 
and his coauthors noted, there are “at least three theories about the relation- 
ship of managerial succession to organizational performance" (1979: 167). 
One theory claims that succession improves performance; a second asserts 
that succession disrupts performance; and a third maintains that succession 
has no effect on performance. Studies of the effects of managerial succession 
on performance have evolved over time. Recent studies have controlled for 
past performance and for effects of regression to the mean. They have consid- 
ered the differential effects of timing of succession—whether it occurs between 
seasons or during a season in the case of sports teams. They have also 
considered the differential effects of inside or outside succession (Allen et 
al., 1979). 

Research studying succession in organizations has had two major theo- 
retical thrusts. First, there has been interest in the effect of succession per se, 
in such questions as whether turnover negatively affects performance (Grusky, 
1963) and in the symbolic and ritualized meanings and forms of succession 
(Gephart, 1978). Second, scholars have assumed that studying succession 
will generate information on the vexing issue of whether leaders or managers 
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make a difference (Pfeffer, 1977; Smith, Carson, & Alexander, 1984). In one 
such study, Lieberson and O’Conner (1972) used succession events as vari- 
ables in partitioning variation in profits, sales, and profit margins to the 
sources of organization, time, and leadership. Salancik and Pfeffer (1977) 
looked at variations in budget allocations in cities in a similarly designed 
study. 

Prior to the recent study by Smith and his coauthors (1984), no analysis 
of succession events had addressed the experience or abilities of successors. 
However, examining successors’ abilities is necessary to fully explore both 
of the theoretical problems that have prompted research on succession. Many 
studies of the effects of succession have found that the event either had no 
effect or a very small effect on performance (Allen et al., 1979; Brown, 1982; 
Eitzen & Yetman, 1972). Explanations for lack of effects have often hinged on 
the abilities or experience of the successors. For example, March and March 
(1977) argued that there is often a filtering process in organizations that leads 
to selection from a fairly homogeneous group. If the individuals selected are 
fairly similar to those they succeed in terms of ability, strategy, and style, the 
effect of succession will be slight. Similarly, organizations that are perform- 
ing poorly and are therefore more likely to experience succession (Allen et 
al, 1979; Salancik & Pfeffer, 1980) may. not be able to attract the best 
replacements. In a competitive market for managerial talent, people who 
perform well are likely to be interested in going to organizations that do the 
same. Unless analysis explicitly includes successors' abilities, absence of a 
succession effect may be due to similarity in the abilities of the different 
managers or to a succession's having, in fact, no effect on performance. 

Similarly, when succession events are used to examine the effect of 
leaders on performance, it seems clear that who succeeds becomes as impor- 
tant as the nature of a succession event itself. Therefore, investigators should 
consider both the characteristics and timing of succession events and the 
backgrounds and abilities of successors. 

The purpose of this study was to examine the effect of successors' abilities 
on the results of succession in 22 National Basketball Association (NBA) 
teams. In the course of the analysis we also replicated previous findings of 
no succession effect for succession considered by itself. 


BACKGROUND 


There is little dispute that one cause of succession is poor organizational 
performance (Allen et al., 1979; Allen & Panian, 1982; Brown, 1982; Salancik 
& Pfeffer, 1980). What remains in question is the effect of succession on 
subsequent organizational performance. The early succession literature 
viewed succession as a disruptive event that depressed organizational per- 
formance. Gouldner (1954) observed that a change of managers in a gypsum 
plant disrupted operations and ultimately led to a wildcat strike. Grusky 
(1963, 1964) developed a vicious-circle theory of succession based on his 
study of baseball teams. He argued that teams are more likely to experience 
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turnover when they are performing poorly, but that once succession occurs its 
disruptive effects will tend to cause further deterioration in performance. 

The common sense view is that changing managers responsible for poor 
performance should make for better performance in the future. Although 
there is some case study data supporting the positive effects of succession 
(Guest, 1962), very little other empirical support exists. Rather, the most 
common finding in the literature is that succession has no effect on 
performance. ‘In a study of college basketball teams, Eitzen and Yetman 
(1972) found that once they controlled for previous levels of team success, a 
change in coaches made no difference in performance. Allen and his col- 
leagues found a succession effect on the performance of major league base- 
ball teams, and further noted that “succession between seasons is more 
likely to improve performance than succession during a season ... inside 
succession is less likely to disrupt team performance than outside succession” 
(1979: 178). However, the size of the effects were small, so that they 
concluded that "a change in managers typically has only a small impact on 
subsequent performance" (1979: 179). Brown also concluded that “there is 
little in the way of a succession effect among professional football organiza- 
tions" (1982: 13). In a study of succession in a sample of Methodist churches, 
Smith, Carson, and Alexander (1984: 772) reported no succession effect for 
the dependent variables of attendance, membership, property value, total 
giving, or operating budget; this result held for both one-year and three-year 
lags. 

There are several possible explanations for findings of no succession 
effects or only small effects. One possibility is that two simultaneous forces— 
a positive effect from the replacement of an unsuccessful manager and a 
negative effect resulting from the disruption occasioned by the succession— 
may cancel each other out. Another possibility is that succession is an 
example of ritual scapegoating (Gamson & Scotch, 1964) and that for a variety 
of reasons managers do not matter much (Pfeffer, 1977). A third possibility 
is that managers do matter (Weiner & Mahoney, 1981), but that in order to 
observe a managerial effect, investigators need to consider the characteristics 
or abilities of managers. Much of the time the kind of filtering process dis- 
cussed by March and March (1977) and Pfeffer (1977) may work to insure that 
new managers are similar to those being replaced. In such circumstances, 
little if any succession effect is likely to occur. Replacement could be ex- 
pected to have an effect only in those instances in which new managers differ 
from the old ones. Moreover, the direction of the effect should be related to 
successors' abilities. 

Smith and his colleagues (1984) investigated the effects of managerial 
ability on organizational performance following succession. Several charac- 
teristics of their study suggest that further investigation of this issue would 
be useful. They studied the careers of 50 Methodist ministers, computing a 
salary equation regressing current salary on years of service in the ministry 
for each individual. Ministers who received salaries more than one standard 
deviation above the mean eight or more times were labeled as effective; 7 of 
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the 50 were so identified. Then, they conducted an analysis regressing vari- 
ous performance indicators such as attendance and giving in ministers’ last 
years with churches on the values of those indicators in the ministers’ first 
years of service; a zero-one variable indicated whether or not ministers 
were high-performing. They inferred the effect of leadership ability on perfor- 
mance from the statistically significant effects of this dummy variable in the 
regressions. 

The study (Smith et al., 1984) raised many issues. First, ministers, not 
churches, were the units of analysis. Instead of following a given church over 
time and examining the effects of performance of ministers of varying ability, 
Smith and colleagues sampled ministers and followed them across churches. 
Assignment of ministers to churches is unlikely to be random, and the 
analysis reported in the article did not allow separation of the effects of the 
assignment process from the effects of the succession event itself. Also, the 
analysis did not test for effects of regression to the mean. Moreover, is salary 
the best predictor of ability, particularly in a labor market that is not likely 
to follow many of the dictates of perfect competition? This is especially 
problematic because the equation used to estimate the ministers’ effective- 
ness did not control for congregational size or resources and thus possibly 
confounded these variables with effectiveness. Conceptually, it would have 
been possible to estimate leaders’ performance levels from their past success 
in improving each of the dimensions of organizational performance and then 
to use this estimate as a subsequent control or indicator of leader effective- 
ness. Finally, the use of only extreme values of leader effectiveness seems to 
limit the explanatory power of the study. If leader ability matters, it is proba- 
bly a more or less continuous variable and might be measured as such instead 
of with a zero-one value. These limitations of Smith and his colleagues’ 
study and the importance of the leadership/succession issue suggest a need 
for further empirical research. 

The current research thus both replicates and extends past research on 
succession. Most past research has tended to treat all managers as equal and 
has ignored variation in characteristics that might be related to their ability to 
affect organizational performance. It seems reasonable to further investigate 
the possibility that all managers—or, in the case of sports teams, all coaches—do 
not have equal ability and that those with superior ability might be better 
able to generate improvement in performance. Failure to account for the 
ability of a successor confounds that ability with the succession event itself. 
The present research both examines the effects of succession and investi- 
gates the effects of variation in ability on the consequences of succession. 
Further, this study attempted to overcome some of the limitations of Smith 
and colleagues’ study by examining several organizations over time and by 
directly measuring past performance of new managers. 


DATA AND METHODS 


This analysis used pooled time series and cross-sectional data for all of 
the teams in the NBA for the 1977 through 1981 seasons. The sample included 
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no teams that entered the NBA after 1977. Twenty-two teams were in the 
NBA for all of the five seasons, resulting in a total of 110 observations. 
Succession within or between seasons occurred in 27 of the 110 instances 
studied. We obtained data from the Sporting News Official NBA Guide for 
each of the seasons included in the sample. Analyzing pooled cross-sectional 
and time series data involves problems with autocorrelation that make the 
interpretation of ordinary least squares regression difficult (Hannan & Young, 
1977). Therefore, most of the analyses reported here derive from generalized 
least-squares (GLS) techniques. The particular GLS procedure employed in 
this analysis is difficult to use with unbalanced designs—for example, with- 
out the same numbers of observations or cases for each year. This restricted 
the analysis to these five years if we wanted to use the 22 teams. Alternatively, 
we could have used a smaller sample extending back further in time, but 
there is no reason to expect that there would have been any difference in the 
results. 

Another issue is our choosing to study basketball teams. Several factors 
motivated the choice of this particular type of organization.! First, examining 
the effects of succession and the abilities of successors demands a setting in 
which succession and organizational and managerial performance are both 
readily measured. As Lenz (1981) noted, there are a number of problems and 
difficulties in studying the performance of business firms. Also, selecting a 
measure of managerial performance is extremely difficult because it is not 
clear what the measure should be and because many performance measures 
are either distorted or unavailable. In sports teams, performance and succes- 
sion measures are readily available and relatively easy to interpret. Although 
future studies of succession should be conducted in settings in which perfor- 
mance is less easily measured, these studies will need to carefully consider 
selection of appropriate performance measures. In particular, these studies 
will need to insure that measures of managerial performance are responsive 
to managerial effort and are not determined primarily by forces outside 
managerial control. Second, because this research atterapted to extend an idea 
developed in the study of sports teams, sports teams seemed to be the most 
appropriate units on which to test our hypotheses. If a different type of or- 
ganization were used and the extension of the original ideas was not con- 
firmed, it would be difficult to determine whether our hypotheses or dif- 
ferences in the types of organizations in which they were tested were re- 
sponsible. Attempting to extend the ideas in their original context elimi- 
nates this potential problem in interpreting the results. 

As Brown (1982) noted, baseball is very much an individual sport in 
which strategy and coaching probably have only a slight effect on perfor- 
mance. Although football teams are, in many respects, more like small busi- 
ness organizations than baseball teams are, their larger coaching staffs and 
player rosters and shorter seasons mean that random variations in perfor- 
mance can have significant effects on win-loss percentages. Basketball, a 
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team sport with essentially one coach and many games played during a sea- 
son, appears to be a useful starting point for the analysis. Thus, we chose 
to study the 22 professional basketball teams in the NBA. 

The measure of team performance we used was the percentage of games 
won during the regular season. This is the same measure employed in all 
previous studies of succession in sports teams of any type. We rejected other 
indicators of team performance like numbers of assists, turnovers, or other 
more process-related variables for two reasons. First, as Scott (1977) indicated, 
outcome measures, if and when they are available and can be unambigu- 
ously defined, are better than process or structure measures because they get 
at the concept of effective performance more directly. One of the advantages 
of using sports teams in these analyses is that there is no problem of multiple 
constituencies wanting different outcomes from an organization. Sports teams 
have a clear measure of success—their win-loss records. Second, a focus on 
process requires investigators to develop a theory of how such processes are 
related to outcomes, and developing a theory of basketball is beyond the 
scope of this paper. Suffice it to note that the relationships between process 
and outcome measures are not straightforward. For instance, in our sample, 
the number of turnovers was correlated only .01 with the teams’ win-loss 
percentages. 

Following Brown (1982) who observed a large effect for the number of 
new players, we also included this variable in the analysis. New players 
were all players on a team’s roster during a given season who had not been 
on the roster during the preceding season. 

The ability of new coaches was measured in three ways. First, we mea- 
sured cumulative win-loss records or percentages of games won prior to suc- 
cession for all coaches who had previously coached in either the NBA or 
the ABA (American Basketball Association). This record included all partial 
as well as complete seasons coached; those with no previous experience re- 
ceived a value of zero for this variable. Of course, coaches are not totally 
responsible for their previous records as the ability of the players they have 
coached and the willingness of owners to spend money to attract good players 
are partly outside their control. Nevertheless, we presume an individual’s 
past record to be a reasonable proxy for ability. Again, one of the advantages 
of studying sports teams is that the measures of ability are clearer and 
cleaner than in other settings. 

Second, a dummy variable indicating whether coaches had previously 
coached professional basketball, either NBA or ABA, was used as a measure 
of ability gained through experience. 

Third, in order to (1) account for the fact that coaches’ win-loss records are 
not completely under their control and (2) to better isolate coaches’ effects on 
team performance, we measured coaches’ ability as the amount of improve- 
ment or decline in performance that occurred in teams coached prior to a 
succession event. Looking only at performance improvements or declines 
controls for relatively stable team characteristics such as owner resources and 
player ability and is therefore a somewhat clearer indicator of the ability of 
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coaches to affect performance than previous win-loss records alone. For a 
given coach, performance improvements or declines were measured in two 
ways: (1) the percentage of seasons in which the win-loss record of the coach’s 
team was better than that team’s average win-loss record under its previous 
coach; and (2) the actual increase or decrease in win-loss record over the 
average win-loss record achieved by the previous coach. We averaged both 
measures over coaches’ entire professional careers up to and including the 
seasons prior to successions. In a few cases, we could not compute these 
measures because coaches’ first jobs had been with newly created teams or 
because their first jobs started before 1969 and lasted until 1977 or later and 
information about prior coaches was not available. In these cases, both mea- 
sures received values of zero. 

A dummy variable indicating succession was coded 1 if succession 
occurred either during the current season or between the prior and current 
seasons. To test the effects of coaches’ records on succession, we created four 
interaction terms. The first multiplied the succession dummy variable by a 
coach’s previous record; the second multiplied the succession dummy vari- 
able by the dummy variable assessing whether or not a coach had previous 
experience in professional basketball; the third multiplied the succession 
dummy by the percentage of seasons a coach produced performance im- 
provements; and the fourth multiplied the succession dummy by the average 
change in win-loss record produced by a coach. If succession had not oc- 
curred, neither a coach’s record nor experience was relevant or included in 
the analysis, since each would be multiplied by zero. The effect of prior ex- 
perience or past record entered only in instances of succession. 


RESULTS 


Table 1 presents means, standard deviations, and a correlation matrix 
for all the variables. As the data combine time-series and cross-sectional ob- 
servations, tests of significance for the correlations are difficult to interpret 
and are not reported (Hannan & Young, 1977). Table 1 shows that performance 
in the past season (r=.52) and the number of new players (r=-.59) have the 
strongest effects on performance in a current season. These results parallel 
those of Brown (1982) and of Allen and his colleagues (1979). Table 1 also 
reveals a positive effect for a coach's past record on performance (r= .26) and 
a small positive effect for a coach's having had experience in professional 
basketball (r= .045). 

Table 1 indicates something about the conditions under which succes- 
sion is most likely to occur. Teams that had poor records in the season just 
past are more likely to experience succession than teams that had good 
records (r=-.305), and they are also more likely to experience turnover on the 
roster (r —-.45). This replicates Brown's (1982) finding for professional football 
teams. The pattern of correlations also indicates why succession may appear 
to lead to a vicious cycle of performance decline. If succession and player 
turnover tend to occur together (r= .26), and player turnover leads to perfor- 
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TABLE 1 
Means, Standard Deviations, and Correlations for All Variables 








Variables Means s.d. 2 3 4 52 62 7 8 

1. Percent of games won 

by team, current : 

season .505  .132 .524 .257 .045 -.059  —.006 -—.591 —.105 
2. Percent of games won 

by team, past season 503 .122 419 100 —.244 ~.110 -.451 ~.305 
3. Coach ability, past 

record 436 8.182 844 .077 .097  —.239 ~.551 
4. Coach ability, previous : 

professional coaching 91 .313 .100 052 —.080 ~.614 


5. Coach ability, percent 

of seasons improved 

over past coach® .068 .239 .732 207  .501 
6. Coach ability, change 

in win/loss record 


over past coach? .009 ` .053 236 .262 
7. Number of new 
players 6.727 3.013 256 


8. Succession between 
current and past sea- 
son or within current 
season? 245 432 





a Correlations with these variables should be interpreted with caution; see text for explana- 
tion. 
bi = yes 


mance decline (r= -.59), then observers may partly misattribute decrements in 
performance resulting from player turnovers to succession events. If decline 
in performance produces both further successions and further changes in a 
roster, performance may continue to decline even if succession itself has had 
no effect on it. 

It is important to note that some of the relationships reported in Table 1 
should be interpreted with extreme caution because the indicators of coach 
ability were not measured identically for teams that did not experience 
succession. In particular, we measured coaches’ win-loss records and previ- 
ous professional experience for all teams, whether or not they experienced 
succession. Therefore, these two measures of ability provide a relatively 
good indicator of the effect of ability on performance, regardless of whether 
or not succession occurs. However, because the measures of ability that track 
improvements in performance required locating data about coaches who were 
not in the sample of interest, we only calculated these measures for those 
teams in which succession actually occurred. Where succession did not 
occur, we coded these measures as zero. Therefore, because they are con- 
founded with succession, these measures should not be used to look at the 
overall relationship between ability and performance. 
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TABLE 2 
GLS Estimates of the Effect of Succession on Performance? 
Equations 
Variables 1 2 3 4 5 
Performance in past Ag tt 318**** .363**** .241*** .310**** 
season (2.57) (5.31) (8.47) (2.68) (10.38) 
Number of new players —.020**** —.020**** = —.020****  Á—.021**** — —.021**** 
(5.72) (16.54) (14.36) (5.92) (38.59) 
Succession .038 — — — TN 
(1.69) 
Succession X coach's — .093**** — — — 
past record (3.88) 
Succession X previous — — .050**** — — 
professional coaching (5.64) 
Succession X percent of — — — .087** — 
seasons improved (2.16) 
over past coach 
Succession X changein — — — — .546 **** 
win/loss record over (4.81) 
past coach 
Intercept .509 .460 .428 .516 .481 
R? .436 444 447 451 .437 





* Deperident variable is performance in the current season. Unstandardized regression co- 
efficients are reported and t-values appear in parentheses. 
*p<.10 
**p«.05 
***p«.01 
eee ED < 001 


It is also important to remember that measuring ability only for succes- 
sion events has no effect on the subsequent analyses that we report. As 
noted earlier in this paper, we multiplied ability by the succession dummy 
variable, which was not zero only when succession had actually occurred. 

Table 2 presents the results of five multiple regression equations.? The 
first equation included only the analysis for effects of succession, without 
reference to either experience or the past records of new coaches. The equa- 
tion indicates that both past performance and player turnover have significant 


? An alternative method for estimating the equations is to use ordinary least squares and to 
include dummy variables for each team (least squares with dummy variables). When this ap- 
proach was used for the analysis reported here, the results were essentially the same as those 
reported in Table 2 with the following three exceptions. First, performance in the past season 
had no effect on performance in the current season in any of the equations. This result is expected 
because the effects of the lagged dependent variable are essentially captured by the dummy 
variables for each team. The dummy variables for Chicago, Cleveland, Detroit, San Diego, and 
Utah were significant and negative in most of the equations. The variables for Los Angeles 
and Philadelphia were significant and positive in all of the equations. All of the other dummy 
variables were not significant in any of the equations. Washington was the omitted category. 
Second, the effects of the four interaction terms are somewhat weaker, although all are still 
positive and most are still significant. Third, R? increases by about .10 in all of the equations. 
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effects on current performance, but there is no significant effect for succession. 
The absence of a succession effect replicates the predominant finding in the 
literature. 

The next four columns indicate that the quality of new coaches has a 
significant effect on performance. Equation 2 estimates the effect of succes- 
sion on performance with the past record of the coaches considered; equa- 
tion 3 takes into account the coaches’ previous professional experience; and 
equations 4 and 5 consider improvements or declines in past performance 
brought about by coaches. In all instances, the interaction term is statisti- 
cally significant and in the expected direction, indicating that (1) coaches 
with better prior win-loss records, (2) coaches with previous experience, and 
(3) coaches who have improved the performance of other teams are all associ- 
ated with better team performance after succession. Because only those 
coaches with previous experience have records to be entered into the equation, 
there is too much collinearity between the various interaction terms to make 
it sensible to enter more than one at a time. Thus, although we cannot 
estimate which of the effects is predominant, we can conclude that the 
ability and experience of a coach have an effect on a team’s performance after 
succession. 


DISCUSSION 


It seems increasingly clear that succession is a complex phenomenon 
with multiple effects depending on timing, the choice of inside or outside 
successors, and the abilities of successors. Previous research has indicated, 
for instance, that outside succession is likely to be associated with more 
organizational change than inside succession (Carlson, 1962; Helmich & 
Brown, 1972). In their study of baseball teams, Allen and his colleagues 
(1979) found some evidence for midseason succession being more disruptive 
than end-of-season succession and for inside succession being less disrup- 
tive than outside succession. Results of the present study indicate that, as 
common sense might suggest, the ability of a successor affects a team’s 
performance after the succession. 

The findings of both previous and the present research suggest that 
merely knowing that succession has occurred is not an adequate basis for 
predicting its consequences. Rather, the type of succession, when it occurs, 
and particularly, the abilities of replacement managers all need to be ac- 
counted for in assessing likely effects of succession. In other words, it is 
probably not fruitful to try to find a single succession effect because the con- 
sequences of successions are likely to vary dramatically depending on the 
conditions surrounding them. This research replicated the finding that 
merely knowing that succession has occurred does not increase ability to 
predict subsequent organizational functioning. Results also confirmed the 
hypothesis that the ability of anew manager is in fact an important predictor 
of organizational performance after succession. Thus, consistent with pre- 
vious findings (Smith et al., 1984), this research suggests that it is important 
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for organizations to select individuals with the best performance records from 
pools of available replacements. 

This research also helps to disentangle the two theoretical problems 
mentioned in the beginning of this article that are associated with succession. 
Some past researchers who have studied the effects of succession events 
themselves have suggested that finding no effect of succession indicates 
that leaders cannot significantly affect organizational performance. This 
research indicates that the question of whether leaders affect organizational 
performance cannot be answered by merely looking at organizational perfor- 
mance before and after succession. Instead, investigators should examine 
conditions surrounding successions and the characteristics of successors, par- 
ticularly ability and experience. By examining these aspects of succession, 
future research may be able to identify types of succession situations that 
allow leaders to have substantial influence on organizational outcomes and 
types of succession situations in which leader influence is more limited. 
Similarly, future research may be able to identify characteristics in addition 
to ability and experience that affect leaders’ abilities to influence organi- 
zational outcomes. 
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This study compares results obtained from six different job evaluation 
methods in terms of (1) their effects on job classification decisions and 
(2) inferences regarding the equity of job hierarchies. Evaluations of 206 
state-classified jobs indicated that decisions about job hierarchies and 
estimates of the extent of gender-based discrimination in classifying 
jobs are dependent on the methods used. 


The issue of pay equity for women has become a major social and eco- 
nomic issue in our society. A wide gap between the median earnings of men 
and women has persisted in the face of profound changes in women’s partici- 
pation in the labor force (Norwood, 1982). This has fueled a national debate 
over pay discrimination that was heightened considerably by the conclu- 
sions of a National Academy of Sciences (NAS) panel that various aspects of 
wage determination, particularly job evaluation procedures, are susceptible 
to gender based discrimination (Treiman & Hartman, 1981). The NAS report 
also suggested that modified job evaluation procedures could provide stan- 
dards of relative job worth that might uncover or reduce discrimination. 
Such standards would go beyond the historical rule of equal pay for equal 
work by mandating equal pay for dissimilar jobs of comparable worth as 
measured by internal or nonmarket criteria of job value. Advocates of the 
comparable worth principle of pay determination argue that it offers a means 
of significantly and immediately reducing the pay gap between men and 
women. If this principle were public policy, many more situations in which 
there is pay disparity would become illegal. 

Although arguments against adopting the comparable worth principle 
are formidable (Livernash, 1980; Lorber, Kirk, Samuels, & Spellman, 1985; 
Nelson, 1980), support for it is burgeoning. State and local governments have 
launched more than a 100 different comparable worth initiatives (Mayer, 
1984), including a 1984 Minnesota statute mandating implementation of the 
comparable worth principle in each political subdivision in the state. 
Interestingly, few of the initiatives specify the criteria of job worth or the 
measurement procedures to be used in establishing relative job worth. This 


The authors would like to thank H. Patricia Curran from the Michigan Department of Labor, 
Office of Women and Work for her support; thanks also to the anonymous reviewers for their 
many helpful comments and suggestions. 
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omission has potentially significant consequences. Because universal norms 
of worth do not exist (Mahoney, 1983), the concept of worth incorporated in 
different job evaluation plans is likely to vary. In addition, the psychometric 
qualities of evaluation plans vary (Madigan, 1985). Consequently, relative 
job worth is likely to be a function of the method of job evaluation used; this 
likelihood has important legal and financial implications as well as implica- 
tions for employee-management relations. 

H decisions on pay classification are a function of the method used to 
evaluate jobs, it follows that inferences regarding pay discrimination also 
depend on method. In analyses of pay discrimination, job evaluation proce- 
dures define variables for job worth or job level. The studies of pay equity or 
comparable worth that most states have conducted or considered are particu- 
larly susceptible to method differences because job worth is their sole crite- 
rion of the fairness of pay systems. A recent court decision, American Nurses 
Association v. Illinois (1985), explicitly recognized the possibility that differ- 
ent job evaluation methods could produce different results. This decision 
disallowed a job evaluation study as the basis for a pay discrimination action. 
On the other hand, in the much publicized case of AFSCME v. State of 
Washington (1983), a simple regression analysis in which pay rates and 
evaluation points were entered was critical to the union's case. In effect, the 
court rejected the state of Washington's definition of worth in favor of a 
consultant's job evaluation study. It is interesting to speculate how the Wash- 
ington court would have ruled had advocates submitted competing mea- 
sures of job worth for consideration. 

The research literature on correspondence among job evaluation plans is 
sparse and subject to differing interpretations. Snelgar (1983) reviewed five 
studies and concluded that different methods produce highly similar results. 
Schwab (1980) emphasized divergence in results on the basis of his review 
of four studies, two of which Snelgar had cited. It is noteworthy that none of 
the reported studies assessed the extent of correspondence between methods 
in terms of decisions about job classifications. Rather, correlation coefficients 
provided their indices of.agreement, an approach that can mask considerable 
divergence in actual classification decisions that are based on different meth- 
ods of evaluating jobs. 

Hence, the purpose of this study was to investigate two related questions: 
(1) To what extent do pay structures vary by job evaluation method? (2) Are 
the results of studies of pay equity or comparable worth sensitive to differ- 
ences in job evaluation methods? 


METHODS 


The data analyzed here were collected for an earlier study of pay equity 
in the state of Michigan civil service job classification structures; the senior 
author served as a member of the technical advisory committee for that 
study. The primary purposes of the original study were (1) to assess the 
feasibility of developing and using an approach that would evaluate all types 
of jobs within the state of Michigan on a common scale of value; and (2) to 
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- analyze the extent to which jobs dominated by members of one gender 
received equal pay for equal value as measured by job evaluation. In 
recognition of the potentially discriminatory aspects of conventional job 
evaluation procedures (Treiman, 1979), project specifications required 
controls for possible gender effects in both job analyses and in the develop- 
ment and application of the job evaluation plan. 

The earlier study was conducted during 1980 and 1981 by a management 
consulting firm under the direction of the Office of Women and Work, 
Michigan Department of Labor. The data collection procedures described 
below summarize relevant portions of the project report. This study appears 
to have been unique among the many recent public sector analyses of pay 
equity in that investigators applied two distinct methods of job evaluation to 
a job sample. However, the study did not directly address the issue of 
differences among job evaluation methods. Hence, for the present study we 
developed alternative procedures for scoring job worth and analyzed data 
from the original study again. 


Job Sample 


Jobs selected for the original study were drawn from 206 job classifications 
in seven service groups in the state's classification system. The groups were 
engineering and science; law enforcement; labor and trades; clerical support; 
public safety, security, and regulatory work; domestic work; and human 
service. The study excluded the legal, executive/administrative, and physician 
groups. Job classes were selected to ensure that the sample (1) represented 
the seven service groups and their full ranges of job complexity, and (2) 
contained jobs that represented groups' gender composition fully enough to 
permit statistical analysis. Classes of jobs that had between 30 percent and 
70 percent men or women were designated as mixed classes; those with over 
70 percent of one gender were defined as dominated by men or women. The 
206 classes in the study took in approximately 25 percent of the total active 
job classes in the seven service groups. The percent of job classes sampled 
within groups ranged from 22 to 48 percent. The imbalance in gender 
composition among service groups necessitated these intergroup sampling 
differences. Over half the job classes consisted of jobs held mainly by men. 
In addition, most job classes were dominated by one or the other gender. In 
the final sample, men dominated 69 classes, women dominated 84, and the 
remaining classes were mixed. 


Job Analysis 


Eight job analysts were hired for the original state of Michigan project. 
All analysts were currently enrolled in graduate programs, or had been 
enrolled recently, and all had completed at least master's degrees in 
industrial/organizational psychology or industrial relations. Analyst training 
consisted of a two-day program covering interview procedures, use of a 
patterned interview guide, job evaluation procedures, and project documen- 
tation requirements. During the job evaluation phase of the training, analysts 
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specialized in using one of the two rating instruments, the Position Analysis 
Questionnaire (PAQ) (McCormick, 1979) or a traditional point plan, both of 
which are discussed later in this paper. Analyst teams with one man and one 
woman were then formed to control for possible gender-related variance in 
perceptions of job characteristics (Arvey, Passino, & Lounsbury, 1977). 

Job analysis procedures were designed to ensure that job information 
would be identical for both evaluation methods. Analysts collected job 
information by reviewing descriptions of both generic classifications and 
individual positions. Job incumbents were then interviewed. One interview 
was conducted for each of the 206 classes. A typical interview included four 
analysts—one PAQ and one traditional team—and from two to eight jobholders 
with at least six months tenure in the same class. The number of people in a 
class and the apparent diversity of assignments within it determined the 
number of interviewees per class. Analysts tape-recorded all sessions to 
provide complete records of the interviews for later reference and to serve as 
primary input when one team could not be present. 

Interviews followed a semistructured procedure. An interview guide 
developed by the consultants specified the content and sequence of infor- 
mation collection to ensure that interviewers would base job ratings on 
comprehensive, job-related information.! The guide specified certain key 
questions critical to accurate completion of the evaluation instruments. Within 
these constraints, interviewers developed their own questions and lines of 
inquiry. They took notes during interviews and were instructed to carefully 
document their observations at the conclusion of each session. 

Data collection procedures differed from typical job evaluation practices 
in that analysts did not observe actual work situations. Moreover, they were 
less knowledgeable than internal analysts would be about all the types of 
work in the service groups. Thus, the job information collected could have 
been deficient or misevaluated to some degree. These shortcomings would 
be serious for an analysis of equity in pay structures if the objective were to 
establish a job hierarchy that accurately reflects differences in job content. 
However, the present study did not address the question of equity in the 
Michigan classification system. Rather, its focus was correspondence between 
evaluation methods. The critical consideration here was whether job 
information was the same for all methods. The original study's data collection 
procedures were appropriate to this objective. 


Job Ratings 


Analysts completed evaluation instruments as soon as possible after 
conducting interviews. Analysts were instructed to refer to their tape 
recordings and notes prior to completing ratings if delays occurred. Each 
analyst was asked to work independently in order to facilitate reliability 
checks. They submitted the four completed job evaluation instruments, 


"The interview guide and other instruments are available from the project archives main- 
tained by the Office of Women and Work, Michigan Department of Labor. 
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interview notes, and tape recordings to the project staff when an evaluation 
was completed. 

Analysts’ efforts underwent a number of quality checks. Project consul- 
tants closely monitored the early stages of data collection and completed third 
sets of ratings on 50 jobs to provide an additional means of auditing analysts’ 
accuracy and reliablity. They also reviewed an early sample of each analyst’s 
job evaluations for evidence of range restriction and leniency or severity.” 
When job evaluations were completed, project staff checked the instruments 
submitted by each pair of analysts for discrepancies in item scores of more 
than one level. Items with such descrepancies went back to the analysts for 
comparison with notes and tape recordings. As a final step toward high 
measurement quality, the two ratings on each traditional factor and PAQ 
item were averaged to provide a single set of ratings for each job on each 
instrument. Hence, the two basic sets of job ratings used in the study consisted 
of (1) the mean PAQ item scores, and (2) the mean, unweighted, traditional- 
plan factor-level scores. 


Instruments 


Traditional plan. The traditional plan was a modified version of the 
primary standards of the federal government Factor Evaluation System (FES), a 
method designed to apply to a broad range of nonsupervisory positions. The 
FES method rates each job on a set of nine job worth or compensable factors 
of the type common to traditional point-factor plans. These include knowledge 
required by a job, its physical demands, complexity, work environment, and 
so forth. Compensable factors receive point values that reflect their relative 
importance to a job’s worth or value as defined by an organization. The sum 
of the ratings on each factor constitutes total job worth, and jobs are ordered 
in the pay structure according to these total scores.? 

The FES plan primary standards were modified in three ways. First, the 
project team added factors to evaluate supervisory responsibilities and char- 
acteristics related to job stress. Second, FES scale definitions and examples 
were revised to make them more relevant to the job sample. Finally, a new 
factor-weighting scheme was developed. These changes reflected the judgment 
of an advisory panel consisting of representatives of unions, civil service, the 
Equal Employment Opportunity Commission, and women's rights groups. In 
the opinion of this broadly representative group, the final set of 11 factors 
was an appropriate and nonbiased set of job worth criteria. This process of 
developing a job evaluation method followed suggestions of the NAS and 
other critics of contemporary job evaluation plans (Remick, 1984). 

Position Analysis Questionnaire. The PAQ is a 187-item quantitative job 
analysis instrument designed to encompass every human behavior that 
might arise on the job (McCormick, 1979). Item scores are combined to 


? Results of these early reviews for discrepancies and problems were not available. 
3 See Henderson (1985) or Treiman (1979) for a more complete discussion of the FES plan and 
construction of point plans. 
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produce dimension scores (McCormick, Jeanneret, & Mecham, 1977) that 
can be treated as compensable factors in job evaluation applications. 
Dimension scores are converted to measures of job worth by using multiple 
regression to find the combination of weighted dimension scores that most 
accurately captures a criterion structure, which is typically the wage rates 
for benchmark jobs. The resulting least-squares line becomes a policy line 
for setting wage rates, and observed dimension scores for all jobs are then 
entered into the equation to obtain job values. This policy-capturing 
approach to job evaluation is not unique to the PAQ. Historically, most 
evaluation plans have mirrored predetermined job hierarchies, and statis- 
tical weighting has been used extensively for over 35 years.* The PAQ differs 
from past approaches to job evaluation primarily in the number and 
specificity of its questionnaire items and in its direct application of a 
regression equation to evaluate job worth. Obviously, to the degree a criterion 
structure reflects bias, this evaluation procedure captures such bias. In the 
present study, two different criterion structures were used to develop PAQ- 
based equations for job worth. These are discussed in the next section. 


Scoring Procedures 


The study used six methods of developing relative worth scores from the 
two basic sets of job ratings. Methods 1 and 2 used subjective weights; methods 
3 and 4 were statistically weighted versions of traditional plan ratings; and 
methods 5 and 6 were the two PAQ equations. The six scoring procedures 
were as follows: 

Method 1. The traditional plan factors and scale levels received point 
values in accordance with both the factor weights determined by the advisory 
panel and with normal procedures for point plan development (Henderson, 
1985). Because Factors 8 through 11 consisted of two subfactors, this 
procedure yielded a total of 15 factor scores for each job. We will refer to 
these factor scores as scale scores. The sum of scale scores was determined 
for each of the 206 jobs to establish its total job worth and relative position in 
the job hierarchy. Table 1 presents the factor weights for methods 1, 3, and 4. 
Method 2 weights were the same as those of method 1 for the selected 
factors. 

Method 2. A simplified version of method 1 was developed in which 
only four of the original scales were used. Research results have often 
indicated that two to four factors dominate point evaluation plans (Lawshe, 
1945; Lawshe, Dudek, & Wilson, 1948; Madigan, 1985). The traditional plan 
was no exception. Factor analysis revealed a three-factor solution explaining 
93 percent of common variance in unweighted scale level ratings, or a four- 
factor solution explaining 99 percent. We developed a four-factor simplified 
plan with four scales from the traditional plan selected as proxies for the 
underlying factors primarily on the basis of their factor loadings (Table 2). 


* See, for example, Stieber's (1959) discussion of the origins and development of the steel 
industry's job evaluation plan. 
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TABLE 1 
Factor Weight Allocations for Evaluation Methods 1, 3, and 4 
Traditional Plan Factors Method 1 Method 3 Method 4 
Knowledge required 11.2% 30% 7% 
Supervisory controls 9.8 T 5 
Performance guidelines 9.2 5 3 
Complexity 10.0 2 11 
Output effect 10.9 1 g 
Personal contacts 7. 5 11 
Purpose of contacts 8.7 1 1 
Physical demands 
Abilities 4.0 1 1 
Exertion 4.4 1 17 
Work environment 
Risk 4.5 3 7 
Working conditions 3.9 2 11 
Stress conditions 
Workpace 3.5 5 5 
Interruptions 4.0 3 1 
Supervisory responsibilities 
Degree of responsibilities 5.4 25 10 
Number supervised 3.5 15 1 
100.0% 100% 100% 
TABLE 2 
Oblique Factor Pattern Matrix for Traditional Plan Scale Scores 
Scale Scores Factor 1 Factor 3 Factor 3 Factor 4 
Knowledge required 86 .06 .00 Dé 
Supervisory controls 71 .09 .26 Dé 
Performance guidelines .74 .06 .09 .00 
Complexity .80 .04 10 .07 
Output effect 7B .08 —.06 .09 
Personal contacts KO —.14 ~.09 14 
Purpose of contacts .Bá 12 —.05. —.01 
Physical abilities —.24 42 — 10 17 
Physical exertion —.01 -76 .00 —.02 
Work risk 12 89 05 01 
Working conditions .10 Kéi 04 —.11 
Workpace 19 .00 .03 .66 
Interruptions 10 .08 44 725 
Supervisory responsibilities .03 .02 .96 .01 
Number supervised .06 .01 95 .08 
Eigenvalue? 6.25 2.18 1.28 71 
Percent variance 41.70 14.50 8.60 4.70 


8 Initial factors. 
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More elegant approaches to developing factor scores could have been used; 
however, the objective here was not to optimally measure an underlying 
commonality but to create a simple plan similar to those that are in use. 
Ratings from the first seven scales all loaded heavily on the first common 
factor. The first scale, knowledge requirements, was selected to represent 
this factor. In a similar fashion, the scale for working conditions was chosen 
for Factor 2, supervisory responsibilities represented Factor 3, and work 
place interruptions represented Factor 4. 

Method 3. A statistically weighted or policy-capturing version of the 
traditional plan was developed with. the maximum of the pay range for 
mixed job classes as the criterion structure. One of the criticisms of factor 
weighting procedures has been that the benchmark jobs used to determine 
weights are typically jobs dominated by men, which relegates factors 
associated with jobs held predominantly by women to minor importance 
(Treiman, 1979). Method 3 is one possible response to this criticism; the 
rates assigned to the mixed classifications are assumed to represent a fair or 
at least gender-blind pay policy that the job evaluation model captures. This 
assumption is acceptable here, but an actual equity analysis would need to 
establish that a given criterion structure represented the entire domain of 
work in a job structure. 

Researchers for the original study chose to use maximum pay ranges 
rather than midpoints because many employees were at or near the maxi- 
mums. Thus, the analysis of equity in the classification structure also served 
as a rough indicator of equity in actual pay distribution. 

Factor weights in method 3 were based on beta weights from a multiple 
regression equation in which maximum pay and unweighted traditional 
plan scale scores were entered.? These weights differed from the subjectively 
determined weights used in method 1. In method 1 fair weights were subjec- 
tively determined via a consensus process, whereas in method 3, job ratings 
were used to statistically model a fair criterion structure and factor weights 


5 Statistically assisted weighting oi point factor plans is an indirect type of policy-capturing 
procedure. Beta weights are used to help allocate values to the factors and degree levels, rather 
than for weighting of observed scores as in the PAQ procedure. In point plans, the maximum 
point values assigned to each factor reflect the relative importance of factors (Sargent, 1972). 
The weighting task involves determining a point allocation scheme that generates a point 
hierarchy closely corresponding to the criterion structure when it is used to evaluate the bench- 
mark jobs. In small organizations, this is typically a trial-and-error process. When a large 
number of jobs is involved, beta weights can facilitate the process because they indicate factors' 
relative contributions to variance in the criterion structure. However, beta weights are not 
readily transformed into an ideal weighting scheme, particularly if a high degree of multicol- 
linearity exists among compensable factors. The procedure used here was to proportionally 
reduce the beta weights to create a 100-point distribution, retaining a minimum weight of one 
for factors. This process established the relative importance of factors. The final set of numerical 
weights was determined experimentally by testing variations of the basic weighting scheme for 
their affect on R?. The usual test for a weighting scheme is its hit rate (Gomez-Mejia, Page, & 
Tornow, 1982), but the multiple classification structures in this job sample precluded this 
approach. 
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came from the beta weights that resulted. In both cases, the point values 
allocated to factors and scale levels reflected the weighting schemes. 

Method 4. This scoring procedure differs from method 3 only in the 
criterion structure used to derive factor weights. Pay rates of classes domi- 
nated by men were used as the criterion structure to test for the problem 
noted in the description of method 3. Thus, this method determined the 
extent to which a different and possibly biased set of benchmark jobs affects 
job evaluation results. 

Method 5. As noted above, two different PAQ equations were developed. 
In method 5, this study used overall PAQ dimensions to capture maximum 
pay for the the total job sample. Because classification rates were heavily 
based on pay survey data, this criterion structure reflects market value. To 
the degree jobs held by women were undervalued in the marketplace, this 
equation was not gender-blind. However, the large and disparate job sample 
provides a possible rationale for defining this as a fair criterion. More impor- 
tant for the present study, the large sample equation roughly approximates 
the approach advocated by PAQ proponents for use of this instrument in job 
evaluation applications (McCormick, Mecham, & Jeanneret, 1977; Mecham, 
1983). Overall dimension scores were selected because (1) they are a more 
concise and less redundant summary of PAQ information than the division 
dimensions; (2) the two sets of scores were equivalent in terms of the propor- 
tions of variance in pay rates explained; and (3) overall interrater dimension 
reliability estimates were slightly better. The mean reliability coefficient for 
the overall dimensions was .83, slightly higher than that of the traditional 
plan factor scores (.80).9 

Method 6. This scoring procedure differed from method 5 only in that 
classes dominated by men provided the criterion structure used to derive 
dimension coefficients. Results from this method allowed for comparisons 
between methods with the same benchmark structures but different factor 
contents and treatments of observed scores. 

In summary, the job evaluation scoring methods (1) were based on two 
different job rating procedures, a traditional plan and the PAQ; (2) represent 
two basic approaches to compensable factor weighting, judgmental and 
statistical; and (3) reflect the effects of using different criterion structures in 
the statistical weighting procedure. 


Analysis 


Convergence among methods of job evaluation was analyzed in two 
ways. First, we calculated intermethod correlation coefficients to provide a 
basis for comparisons with earlier studies. Second, job worth scores from 
methods 2 through 6 were transformed to the metric of method 1 (Nunnally, 
1967:107) to allow direct comparisons between job scores of different 
methods. Method 1 had a 500-point theoretical maximum with actual scores 
ranging from 131 to 418, a range of 285 points. We calculated differences in 


§ These reliabilities compare favorably with those of previous job evaluation studies. 
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job scores between pairs of methods for all 206 classes and all possible 
method pairs. Means and standard deviations of the distributions of absolute 
difference scores were then calculated to provide indicators of disagreement 
between evaluation methods. Meaningful interpretation of point differences, 
of course, requires knowledge of the point ranges associated with pay grades. 
For example, in the Midwest Industrial Management Association plans, pay 
grade increments equate to point differences of 6 to 8 percent of the total range 
of scores. That is, a point difference between jobs of greater than 8 percent 
will result in different pay grades. In the present study, we used a 10 percent 
criterion to estimate agreement rates; a point differential between methods of 
less than 10 percent of the total range of job scores constituted agreement. 
This 10 percent or ten-level rule provided a liberal definition of classifica- 
tion agreement; it was apt to overstate the agreement rates because pay struc- 
tures for service groups each contained from 8 to 13 pay grades. 

The effects of method differences on analysis of equity in a pay structure 
were determined by using the alternative job worth measures in two basic 
approaches to discrimination analysis suggested by the NAS report (Treiman 
& Hartman, 1981). First, we estimated the effect of the gender composition of 
job classes on the classification rates by regressing maximum pay on job 
worth points and a gender variable. A negative, significant gender coefficient 
after job worth is controlled for indicates potential discrimination. Separate 
analyses were conducted for each job evaluation method using both categori- 
cal and continuous measures of gender composition; in the first, the gender 
variable was coded woman-dominated = 1; in the second, the variable was 
the ratio of women to total incumbents. 

Second, the pay rates for job classes dominated by women were com- 
pared with estimates of fair pay for these classes. We developed estimated or 
predicted fair pay levels by (1) regressing maximum pay on scale/dimension 
scores of job classes dominated by men, and (2) applying the resulting coeffi- 
cients to the observed scores of jobs held mainly by women. The rationale for 
this analysis is that fairness dictates that compensable factors for all job 
classes should be equally valued. Hence, when estimated rates for women 
exceed actual rates on the average, there is evidence of possible discrimi- 
nation. Again, we conducted six separate analyses, testing mean differences 
between actual and predicted pay rates for significance with paired-com- 
parison t-tests. : 

Previous research has documented the potential problems with using 
regression procedures to detect or correct pay discrimination (Ramsey, 1979; 
Roberts, 1980). However, such issues are not germane to the present study, 
which focused on the effects of evaluation methods, not on analysis of classi- 
fication discrimination. The procedures described in the previous sections 
were appropriate because they represented current practices in analyzing pay 
equity, and the issue under study was the degree to which the outcomes of 
such analyses vary by evaluation method. 


94 Academy of Management Journal ] March 


RESULTS 


Table 3 presents intercorrelations among job evaluation scores generated 
by the six methods. This table also shows the simple correlations between 
job scores and maximum pay for the 208 classifications as well as the multi- 
ple correlations between maximum pay and sets of factor and dimension 
scores. Results indicate considerable disparity in outcomes of job evaluation 
plans. The low correlation between methods 4 and 6 (.41) is particularly 
notable because (1) the same criterion structure was the source of factor 
weights in both plans, and (2) the two methods explained the same percentage 
of criterion variance. The extent of disagreement between these two plans 
reveals the potential effects of evaluating jobs with different sets of job worth 
criteria and scoring procedures. It is also worth noting that multiple corre- 
lations between maximum pay and PAQ dimension scores are no higher than 
correlations between maximum pay and the traditional plans' scores. In this 
job sample the detailed level of analysis required by the PAQ yielded no ad- 
vantage in explaining variance in classifications. 

Table 4 provides a basis for evaluating differences in methods in terms 
of their potential effects on classification decisions. The table shows mean 
differences in job worth scores and estimated agreement rates for each pair of 
methods. Differences between methods were substantial. For example, the 
two basic procedures (1 and 5) differed markedly, with probable agreement 
, on only 54 percent of the jobs. The differential in job worth scores between 
methods 1 and 5 was less than 10 percent of the range of job scores for 54 
percent of the jobs.? If the agreement criterion had been relaxed to 15 percent, 
divergence in classification decisions would still have been 28 percent. 
Moreover, the size of many of the standard deviation values suggests that 
classification differences often would be greater than one level. 


TABLE 3 
Intercorrelations among Alternative Job Evaluation Scores 
and Classifications' Pay Ratesa 


Job Evaluation Methods 1 2 3 4 5 6 


1. Traditional .92 .90 .93 .83 .64 
2. Simplified .95 .86 .83 63 
3. Traditional — mixed .84 .84 .63 
4. Traditional — man-dominated .71 41 
5. PAQ — total .82 
6. PAQ — man-dominated 

Maximum pay -80 81 81 .68 .86 71 


Rb .89 .86 ,89 .89 .86 .78 


an = 206 job classes. 
b Multiple correlation, maximum pay for classification with set of factor/dimension scores. 


? The range of job scores was defined as the mean range of the two methods compared. 
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The agreement rates in Table 4 might be compared with the findings of a 
recent comparison of the predictive accuracy of seven job evaluation meth- 
ods (Gomez-Mejia, Page, & Tornow, 1982). That study operationally defined 
accuracy in terms of hit rates, the percentage of classifications made with a 
job evaluation method that were within +1 grade of a job’s actual assigned 
grade. The rates ranged from 49 to 73 percent, slightly higher than the current 
study’s. However, two differences between the studies must be considered 
in drawing comparisons. First, Gomez-Mejia and colleagues looked at only 
one occupational family, management, and based all seven of their scoring 

. procedures on a single set of incumbent responses to a customized 235-item 
questionnaire. These conditions could be expected to increase correspon- 
dence between methods. Second, agreement rates and hit rates are not directly 
comparable. The agreement rates in Table 4 merely indicate the extent to 
which two methods produced similar scores, not whether results fell into 
predetermined ranges of values. 

Method differences also can significantly affect conclusions regarding 
the equitability of a pay structure. Table 5 displays results obtained when we 
used job scores from the six methods in the two approaches to analysis of 
discrimination in classification structures that the NAS report (Treiman & 
Hartman, 1981) suggested. The regression coefficients? indicate gender effects 


TABLE 4 
Differences in Job Scores and Classification Agreement 
between Pairs of Job Evaluation Methodsa 





Job Worth Point Difference 
iv. 2 19.7 14.4 73% 
iv. 3 21.5 17.8 58 
iv. 4 18.0 13.5 78 
1v.5 28.1 21.6 54 
1v.6 41.0 31.2 40 
2v.3 14.2 11.8 84 
2v.4 26.6 17.8 55 
2v.5 27.5 20.7 53 
2v.6 42.0 30.8 34 
3v.4 27.6 20.3 58 
3v.5 27.1 21.3 55 
3v.6 43.3 28.9 32 
4v.5 36.8 28.0 44 
4v.6 53.9 38.2 27 
5v.6 28.2 23.1 54 





an = 206 job classes. 
b Absolute difference. 
* Agreement defined as a differential within 10 percent of the range of job scores. 


ê Regression coefficients were obtained using the dummy and ratio measures described in the 
previous main section. ' 
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after total job worth, as measured by the job evaluation methods, was con- 
trolled for. In all six analyses the coefficients were negative and statistically 
significant. However, the issue of interest here is the variance in discrimina- 
tion indices across plans. 

The regression coefficients for method 6 represent an improbable case— 
use of an auditing tool for analysis of discrimination derived directly froma 
criterion structure limited to jobs held mainly by men. Hence, method 6 
should be disregarded here, except as evidence of what could happen if 
compensable factors do not represent a total job structure. However, substan- 
tial differences remain among the other five methods. For example, the 
monthly salary decrement associated with a 1 percent increase in the num- 
ber of women in a job class is 64 percent greater for method 3 than for 
method 4 (1.92 vs. 1.17). The mean-differences tests show similar method 
effects. Analysis using method 1 suggests that, on the average, women- 
dominated classifications are underpaid by $427; the same analysis using 
method 5 suggests only a $57 discrepancy. In short, analyses of equity in pay 
structures are highly sensitive to differences in job evaluation methods. 


DISCUSSION 


The results reported here indicate that concern about the effect of differ- 
ences in job evaluation methods on decisions pertaining to pay equity is well 
placed. If relative job worth is based solely on job evaluation procedures, 
choice of evaluation method will influence pay structures. Consequently, 
inferences regarding gender-based discrimination drawn from studies of pay 
equity based on job evaluation procedures are also method-dependent. 

The results of this study have significant implications for voluntary and 
mandated attempts to incorporate the comparable worth principle in pay 
policies. As noted earlier, comparable worth initiatives tend to ignore the 
fact that there is no definitive test of the fairness of a pay structure. Equity in 


TABLE 5 
Comparison of Results from Discrimination Analyses 
Using Different Job Evaluation Scores? 


Job Evaluation Methods 

Discrimination Indicator 1 2 3 4 5 6 
Regression coefficients? 

Percent women —1.57 —1.81 —1.92 —1.17 —1.61 —4.68 

Dummy variable> —108.00  —138.00  Á —151.00  —93.00  —104.00 ~-309.00 
Mean differences test : 

Estimated v. actual $427 $320 $137 $303 $57 $322 

ts 23.3 17.5 7.1 15.8 3.0 14.3 


an = 206 job classes. 
b Women - 1. 
* All significant at the .01 level. 
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a pay structure is like beauty—in the eye of the beholder. The NAS report 
(Treiman & Hartman, 1981:96) explicitly recognized this fact in recommenda- 
tions for removing or controlling sources of gender-related bias in the devel- 
opment and administration of evaluation plans. However, the NAS panel 
avoided the basic issue: in the absence of a complete consensus, whose 
values should prevail? Development of job evaluation procedures involves 
discovering and reconciling competing criteria of job value; individual val- 
ues or biases are integral to judgments about the worth of jobs. Modifying 
the development and administration of evaluation plans might reduce bias 
but would not ensure that one gender-blind method is more equitable than 
another. The present study suggests that organizations could generate ranges 
of equitable pay structures and choose methods that best meet their objectives. 
Similarly, courts or regulatory agencies charged with enforcing comparable 
worth policies could face situations requiring choices between competing 
versions of fair pay structures. 

Why do these results so strongly indicate method divergence, when pre- 
vious investigators often concluded that method differences were negligible? 
This apparent conflict is not easy to reconcile, in part because previous studies 
jacked detailed information. However, a number of explanations are possible. 
First, limitations of the present study might account for the extent of dif- 
ferences among methods. We have already noted shortcomings in the data 
collection procedures. Furthermore, some method differences could be due 
to statistical artifacts. The relatively small criterion sample used to develop 
factor and dimension weights in methods 3 and 4 and the multicollinearity 
of the traditional plan could have resulted in weighting schemes that were 
highly sample-specific. However, these shortcomings are not unique to this 
study. Perhaps more important, they are apt to be typical of current practices 
in designing and auditing job evaluation plans. 

Second, job evaluation plans traditionally have been confined to single 
occupational families such as factory, technical, clerical, or managerial work, 
whereas the present study sampled six disparate families. This diversity in 
the job sample could have magnified differences between methods in com- 
pensable factors and weighting schemes. 

Third, the earliest comparisons of job evaluation plans (Lawshe, 1945; 
Lawshe & Maleski, 1946) were between simplified and complete versions of 
standard point plans similar to methods 1 and 2, and applied to a single job 
family. Investigators have often cited the high interplan correlations from 
those studies and have overlooked conflicting evidence from comparisons of 
distinctly different methods (Atchison & French, 1967). The results obtained 
here support the intuitive belief that divergence in results will increase with 
differences in job worth criteria, weighting schemes, and scoring procedures. 
Finally, the focus in the past typically was on the relative efficiency of 
alternative methods for predicting or capturing common criterion structures 
(Robinson, Wahlstrom, & Mecham, 1974). Investigators have interpreted simi- 
larity in prediction as agreement between methods and have not investigated 
effects of method differences on actual decisions about pay levels. 
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Additional research is needed on all aspects of the development, 
implementation, and administration of comparable-worth job evaluation 
plans. However, the most pressing research needs pertain to the basic issue 
of defining value. The premise of the comparable worth movement is the 
belief that it is possible to define and measure broadly applicable and accept- 
able dimensions of job value. Whether such general agreement can prevail 
across diverse occupational groupings, particularly if historical pay relation- 
ships might be disturbed, has yet to be demonstrated. In fact, consensus 
within occupational families might be difficult to achieve. Employees and 
employee subgroups differ in how they perceive relevant job-value criteria; 
comparable worth advocates’ frequent references to “men’s” and “women’s” 
job-value dimensions serve to demonstrate this fact. More needs to be learned 
about the extent to which job-value criteria differ, their key determinants and 
correlates, and whether job evaluation development procedures can modify 
these values. There is also a need to investigate the extent to which indepen- 
dent job evaluation committees from the same organizations and using the 
same basic methods, like point plans, generate different factors and weights. 
Such studies would provide direct and practical tests of how adequate job 
evaluation procedures are for implementing the principle of comparable 
worth in wage determination. 

In summary, this study investigated the consequences of using different 
operational definitions of job worth as a basis for pay-related decisions. We 
developed six different methods of determining job worth and applied them 
to 206 job classes. Results indicated that choice of evaluation method signifi- 
cantly affects decisions on job hierarchies and pay equity. This finding has 
major implications for the current debate over the concept of comparable 
worth. 
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In a seminal study, Ross (1957) found that high seniority employees 
who were discharged but subsequently reinstated by arbitrators were 
significantly better performers than reinstated low seniority employees. 
Only one study since Ross's has replicated this result. Others have 
reported contradictory evidence. This study proposes a new theory to 
explain the relationship between seniority and job performance follow- 
ing reinstatement. By quantitatively combining results across studies, 
we show that senior grievants whom arbitrators exonerated were, just 
as Ross found, hetter performers than junior grievants. However, rein- 
stated senior grievants who were not exonerated were found to be signifi- 
cantly poorer performers than nonexonerated junior grievants. 


Arbitrators often take seniority into consideration as a mitigating factor 
when deciding whether to sustain a discharge for just cause or to reinstate a 
grievant (Elkouri & Elkouri, 1973). Ross (1964: 148—149) argued that cases 
involving senior employees with good work records should receive special 
consideration because their infractions were deviations in behavior that are 
unlikely to be repeated. He based his argument on evidence (Ross, 1957) that 
reinstated employees with long seniority are more likely to be satisfactory 
workers than reinstated workers with short seniority. 

Ross’s research has had widespread influence on arbitration decisions 
(Jennings & Walters, 1976) even though subsequent findings have been mixed. 
Of the studies that have examined the relationship between seniority and 
performance after reinstatement, only one has replicated Ross’s original results 
(McDermott & Newhams, 1971: 531). The bulk of the research on this issue 
has questioned the presumption that the performance of senior workers after 
reinstatement is more satisfactory than that of junior workers (Adams, 1978; 
Jones, 1961; Malinowski, 1981; Rodgers, Helburn, & Hunter, 1983). Jones 
argued that: “Seniority. .. has little bearing upon whether an employee re- 
mains or becomes a satisfactory employee” (1961: 40). On the other hand, 
Adams found that “reinstated employees with greatest seniority tended to 
attract more subsequent discipline and discharges” (1978: 74). 
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The purpose of this study was to reconsider the relationship between 
seniority and performance after reinstatement in the light of this seemingly 
contradictory evidence. First, however, we consider the possibility that Ross 
was correct all along and that the discrepant results found in subsequent 
studies were simply artifacts of sampling errors. 

Obtaining data on the performance of employees reinstated by arbitra- 
tors has traditionally been difficult, resulting in a small number of studies 
with small samples. The number of reinstated workers included in these 
studies ranges from a low of 16 (Rodgers et al., 1983) to a high of 93 (Adams, 
1978). A restriction in range of seniority is also evident. For example, Jones 
(1961) included no observations for grievants having less than 3 or more 
than 10 years seniority. Given small sample sizes and dramatic restrictions 
in range, failure to identify marked patterns between seniority and job perfor- 
mance after reinstatement in these studies would not be surprising. 

Populations studied provide another clue as to why previous studies 
yielded considerably different results. Some of the studies combined exoner- 
ated grievants who had been reinstated with full back pay and allowances, 
including restoration of seniority, with exonerated grievants who forfeited 
partial or full back pay but had nonetheless been reinstated. Other studies 
excluded exonerated grievants from their samples either inadvertently or by 
design. If the relationship between seniority and performance is different 
for exonerated and nonexonerated employees, mingling the two groups could 
have generated the equivocal results that have been reported. Thus, in the 
following section we consider three theoretically plausible models that dis- 
entangle exonerated and nonexonerated grievants. 


THE MODELS 
Model 1 


Model 1 suggests that the performance of exonerated workers is better on 
the average than the performance of nonexonerated workers. And it pre- 
sumes Ross’s conclusion to be valid: after reinstatement, senior workers are 
in fact better performers than junior workers. 

Empirical evidence has shown that the relationship between work expe- 
rience and job performance for the working population in general is positive. 
The average correlation between job experience and work performance across 
425 studies has been shown to be r = +.18 (Hunter & Hunter, 1984). Thus, 
whether a worker has been exonerated or not, on the average, senior workers' 
performance after reinstatement should be better than junior workers' perfor- 
mance after reinstatement. Figure 1a depicts the positive regression slopes 
for exonerated and nonexonerated workers implied by this model. 


Model 2 


Model 2 assumes that grievants with high seniority will perform poorly 
after reinstatement whether or not they were exonerated. Perhaps employ- 
ers always conduct systematic reviews of grievants' work histories before 
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discharging them, regardless of the issues involved or the severity of infrac- 
tions. We would expect employers to require flagrant and repeated demon- 
strations of unsatisfactory performance before discharging any senior em- 
ployee. Even employers who do not care about justice or loyalty might fear 
that unjust discharges of high performing senior workers will chill the mo- 
tivation of workers who remain. , 

Generally speaking, arbitrators are also somewhat disposed to give senior 
workers second chances, regardless of the nature of infractions (Ross, 1964: 
148-149). Thus, because companies and arbitrators apply more lenient stan- 
dards to senior workers than to junior workers, we might expect the job per- 
formance of senior workers to be poorer than that of junior workers after 
reinstatement. The performance of exonerated employees should still be 
better than the performance of nonexonerated workers, however (see Figure 
1b). 


Model 3 


The first two models ignore the nature of infractions. Model 3 contrasts 
two different situations: repeated minor offenses versus one-time major 
offenses. For example, consider repeated tardiness or repeated absenteeism. 
In these instances, it is likely that employers would consider work records 
before deciding to discharge employees. Newly hired individuals are likely 
to be fired regardless of work records, but long-term employees are likely to 
be fired only if they have bad work records. If this is true, there would be a 
negative correlation between seniority and performance among those fired 
for repeated minor infractions. In this case, it is also likely that guilt is not 
the issue; rather, the issue is whether a worker deserves a second chance or 
not. Workers who get second chances are reinstated but not exonerated. 

On the other side might be allegations of theft or heated arguments with 
supervisors. Here guilt would be a primary issue and work record would not 
enter into consideration. Such workers would be reinstated only if exonerated. 
In these cases, accused workers are likely to constitute a random sample in 
regard to performance. Thus, we would predict the usual positive correla- 
tion between experience and performance, 

Model 3 thus predicts that the regression lines for exonerated and 
nonexonerated workers will have opposite slopes: performance will be higher 
for senior workers among those exonerated and performance will be lower 
for senior workers among those who were not exonerated. Figure 1c shows 
these predicted relationships. 

A direct test of model 3 would require data relating numbers, types, and 
severity of infractions. Unfortunately, none of the previous studies reported 
these data. In addition, there is no way to ascertain from the studies how 
carefully employers reviewed employee work histories before rendering dis- 
charge decisions. The present analysis used whether grievants were exoner- 
ated or not as a proxy for the severity and frequency of infractions leading to 
discharges. Model 3 predicts that workers fired for repeated minor infractions 
will rarely be exonerated; it also predicts that workers accused of major 
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infractions will be reinstated only if exonerated. Thus, model 3 assumes a 
high correlation between whether workers were exonerated and whether 
infractions were one-shot major offenses or repeated minor offenses. 

Separate data are not available for exonerated and nonexonerated workers. 
Rather, some previous studies considered only nonexonerated workers and 
other studies included both groups. It is mathematically possible to obtain 
estimates for exonerated workers by using these two sets of study results, 
as the Appendix shows. 


METHODS 


A search of the arbitration literature was conducted in journals specializ- 
ing in labor and industrial relations issues, including Industrial and Labor 
Relations Review, Industrial Relations, Journal of Labor Research, National 
Academy of Arbitrators Proceedings, Arbitration Journal, and the Industrial 
Relations Research Association Proceedings. We located six relevant studies. 

To cumulate results quantitatively, results from each study must be 
converted to a common metric (Hunter, Schmidt, & Jackson, 1982). The data 
found in the located studies were converted to two metrics: (1) the propor- 
tion of satisfactory employees within designated seniority intervals, and 
(2) the correlation coefficient. 

Table 1 summarizes data reported for each study on the job performance 
of 219 grievants who were reinstated by arbitrators. Cumulative estimates 
that include an additional 58 grievants from the Adams (1978) study appear 
at the bottom of Table 1, increasing the total sample size to 277. 

Cumulative estimates for the 219 grievants show that 47 percent of the 
workers with two years or less seniority performed satisfactorily after 
reinstatement, and 53 percent of those with 20 or more years performed 
satisfactorily. When the intervals are collapsed, enabling inclusion of all of 
Adames (1978) results, the trend across seniority categories, which were 
0-5, 6-10, 11-20, and over 20 years, is insignificant (X^ = 4.313, df = 3). 

Since the intervals have different ranges and different weights, these 
comparisons are based on interval estimates of varying degrees of reliability. 
For example, a much larger number of reinstated workers fell within the 
lowest seniority interval than within any other interval. The prediction that 
54 percent of 145 reinstated workers with 0-5 years seniority will perform 
satisfactorily upon reinstatement clearly has considerably less sampling error 
than an estimate that 53 percent of 15 employees with over 20 years seniority 
will have satisfactory performance. 

Correlations are a more useful statistic for considering whether differ- 
ences in estimates across studies are due to sampling error or to true differ- 
ences in study populations. We drew correlations from the studies as follows: 
Rodgers and colleagues (1983) reported a correlation between seniority and 
job performance after reinstatement of r = -.25. Jones (1961) reported senior- 
ity and job performance data for each reinstated grievant. Using these data, 
we correlated seniority with performance. Performance after reinstatement 
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was measured by 1 = above average; 0 = average; -1 = below average. This 
yielded a correlation of r = -.38. Using the data in Table 1 , we approximated 
correlations in the other four studies. Seniority for each worker was esti- 
mated using the mean of the interval midpoint for that worker. Performance 
ratings were successful =1 and unsuccessful =0. We did not include data 
reported in the over-20 interval in the tabulation because the interval mid- 
point could not be determined. 

Table 2 reports the correlations (r’s) that were calculated using these 
procedures. Had the studies reported the raw data with which to calculate 
the correlations, the coefficients could have been slightly different. With as 
many as five intervals, however, the difference between the correlations 
estimated from interval data and the correlations that might have been calcu- 
lated from individual observations is trivial. Although correlations calcu- 
lated from individual observations would still have been preferable, study 
correlations calculated from interval data are close estimates of those cor- 
relations. 

Observed correlations were compared to see whether discrepancies across 
studies were due to true differences in study populations or to sampling 
error. The procedure had five steps. First we calculated a weighted average 
correlation. Second, we calculated the observed weighted variance in the 
correlations. Third, we estimated the proportion of the variance due to sam- 
pling error using the standard formula. Fourth, we subtracted the proportion 
of the variance due to sampling error from the observed variance, providing 
our best estimate of the population variance across studies, with sampling 
error taken into account. Fifth, since our estimate of the population variance 
was not zero and therefore could not be explained by sampling error, we 
tested the moderator hypotheses by grouping the studies into two subsets. 
The first subset included studies with grievants who were reinstated with 
full back pay and grievants who were reinstated with less-than-full back pay— 
a mixed subset of exonerated and nonexonerated individuals. The second 
subset included studies of grievants who were reinstated with partial or no 
back pay—nonexonerated individuals. We then repeated the first four pro- 
cedures for each subset. If the population variance was zero for both sub- 
sets, we could conclude that performance was moderated by inclusion of 
exonerated grievants in a study sample. 


RESULTS USING CORRELATIONS AS THE COMMON METRIC 


First, assuming that the true population correlation is constant across 
studies, our best estimate of the population correlation r was a weighted 
average, where N; is the number of reinstated workers in study i: 


X(N ili i 


£N; 
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As would be expected from the generally inconsistent findings summarized 
in Tables 1 and 2, the average correlation between seniority and perfor- 
mance after reinstatement is quite small. 

Second, we calculated the variance of r across studies. The best estimate 
of the variance across studies was the frequency-weighted average squared 
error, where r = the mean weighted correlation (.03): 


_ SIN; (Œ -p4 
SN; 


2 


er -.0441, 


And o, — .21. Third, a portion of this variance may still be due to sampling 
error Cg”. The magnitude of this error, where k = the number of studies, was 
estimated as follows: 


Fourth, the population variance a,” was obtained by subtracting the 
variance due to sampling error from the observed variance: 


ge = o - ge = .0441 - .0216 = .0225. 


Thus, taking sampling error into account dropped our estimate of the 
standard deviation of the population correlation from op = .21 too, = .15 (the 
square root of .0225). However, this is still a large variation. A formal statisti- 
cal test for determining whether no variation exists (Hunter et al., 1982: 
45-47) showed that the variation across studies was greater than what might 
be dictated by chance alone (x? = 12.24, df = 5). This analysis suggested the 
presence of a moderator variable, as model 3 predicted. 

The reinstatement studies were thus divided into two subsets, those 
with only workers who had not been exonerated and those with mixed 
samples. Three studies (Adams, 1978; Jones, 1961; Rodgers et al., 1983) 
included no employees who had been exonerated by arbitrators (see Table 
2). These studies formed one subset. The three remaining studies (Malinowski, 
1981; McDermott & Newhams, 1971; Ross, 1957), which included 39, 35, 
and 30 percent exonerated employees respectively, formed a second, mixed 
subset. Two study statistics were calculated for each of the subsets—the mean 
correlation and the corrected variance. The first subset consisted of studies 
that included only employees who had not been exonerated (n — 126). The 
second subset included studies that mixed exonerated employees with 
nonexonerated employees (n = 151). The correlation is negative (r = -.18,n = 
126) for the nonexonerated group and positive (r = + .20, n = 151) for the 
mixed group. The difference between the correlations is also significant at 
the .001 level (z — 3.13). 
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TABLE 3 
Analysis of the Subsets 

Subset Statistics Nonexonerated Mixed? 

r -18 T.20 
op, .0079 .0105 
oi, .0223 .0198 
o, -.0144 -.0093 

by 0 ; Ü 


*Subset consisted of studies with both exonerated and nonexonerated employees. 


Analysis of the subset characteristics, shown in Table 3, indicates that 
including exonerated workers in a study sample moderates the relationship 
between seniority and performance after reinstatement. This is illustrated by 
the finding that the weighted correlation differs considerably across the 
subsets and the corrected variance c, is zero in each subset (Hunter et al., 
1982: 48). Studies that included no exonerated employees were more likely to 
find an inverse relationship between seniority and performance. Studies that 
included exonerated employees in their samples were more likely to find a 
positive relationship between seniority and performance. This explains why 
the weighted mean correlation is so small (r = .03) when the six studies are 
combined. We thus have reasonable cause to believe that the regression 
slope for exonerated workers is not parallel to the regression slope for non- 
exonerated workers. This tends to disprove models 1 and 2. 

Testing the model 3 hypotheses requires data for two groups of employees, 
those who are exonerated and those who were not. We have pure data on 
employees who were not exonerated, but data for exonerated workers are 
mixed with data on those who were not. We found the strength of the moder- 
ated relationships by calculating the regression slope for the known non- 
exonerated group and deriving the regression slope for the unknown exoner- 
ated group. As the Appendix reports, the regression slope for the exonerated 
group is b = .040; for the group that was not exonerated, it was b = -.015. Thus, 
these calculations provide support for model 3. 

Figure 2 illustrates regression slopes based on the derived estimates 
between seniority before discharge and job performance following reinstate- 
ment for exonerated workers and for workers who were not exonerated. 
Exonerated workers with 10 years or more seniority almost always perform 
satisfactorily after reinstatement. A steep decline appears for workers who 
were not exonerated, suggesting that less than 20 percent with 30 years 
seniority perform satisfactorily. 


DISCUSSION 


For a topic such as ours, where previous research is scant and empirical 
data quite difficult to obtain, the quantitative cumulation of results can be 
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FIGURE 2 
Predicted Relationship between the Performance of Exonerated 
and Nonexonerated Grievants Following Reinstatement 
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useful. Our analysis has indicated that the apparently contradictory evidence 
in the literature on seniority and job performance following reinstatement 
should not be interpreted as inconclusive. Rather, cumulated results demon- 
strated support for one of three theoretical models. We found a positive 
relationship between seniority and job performance following reinstatement 
for grievants who had been exonerated and a negative relationship for griev- 
ants who had not been exonerated by arbitrators. 

Our estimate of the correlation between seniority and job performance 
following reinstatement for exonerated workers of r = + .20 is remarkably 
similar to Hunter and Hunter’s (1984) true population estimate of the cor- 
relation between work experience and performance. The results thus vindi- 


cate Ross’s (1957) original conclusion, but only for workers who were ex- 
onerated. 
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Model 3 implies that the critical factor is not seniority per se; it is 
probably the type of infraction that triggers a discharge. When one time 
major infractions are involved, companies are unlikely to review employee 
work records. We speculate that with multiple infractions companies are 
likely to review work histories and, as a result, to take seniority into consider- 
ation as a mitigating factor. This results in the discharge of senior workers 
with unusually poor work histories. 

Because of data limitations, our test of model 3 has been indirect. We 
could extract no data from the studies regarding the nature or the frequency 
of previous infractions. We hope that future studies will take these factors 
into consideration. 

The practical implication of our analysis is apparent. In certain instances— 
such as when multiple infractions trigger a discharge—we suspect that a 
review of a grievant’s work history is part of the discharge process. Under 
such circumstances, arbitrators should not give special consideration to griev- 
ants who have high seniority. Contrary to conventional wisdom, our results 
indicate that their performance after reinstatement will be less satisfactory 
than the performance of junior grievants. 
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APPENDIX 
Derivation of the Regression Slopes 


The slope of the regression coefficient for the nonexonerated subset is calculated as follows: 
g 


= perf 
bnonexon 7 T X C ) , 
sen 





.50 
= -18 x [2] = : 
bnonexon (23) 015 


The slope of the regression for the exonerated subset must be derived. The mean covariances 
and the mean correlations for the exonerated group and the nonexonerated group affect the 
regression line derived from the mixed study data. The variance for seniority and performance 
in the exonerated group is derived from the fact that the variance of the mixed group is equa! to 
the variance of the means plus the mean of the within group variances. Characteristics of the 
exonerated group are not known, but may be derived from the fact that the mean (X) of the mixed 
group is equal to the weighted mean Daun) for the exonerated group plus the weighted mean 
(Xnonexon) for the nonexonerated group. In the mixed studies exonerated workers constituted 
34 percent of the population, such that 


Xmixed = -34 Xexon + -86 Xponexon . 


Calculation of mean seniority for the exonerated group Xexon is thus: 


= 7.19 — .66 X 6.93 _ 
Xexon OT “Sage 35 = 7.70 DH 


Similarly, mean performance for the exonerated group Prou is: 


p 54-66 x.50 


Po = 91. 


.34 


For the mixed group, 


2 


= 2 2 D H 
o means + average (07 within) - 


Taking the observed mean on seniority for the nonexonerated from Table 2 (6.93) and the 
derived mean on seniority for the exonerated (7.70), the Smeans is! 


.66 X (6.93 — 7.19)? + .34 X (7.70 — 719) = 33. 
Further, where o" on seniority for the nonexonerated = 6.02? = 36.24 and the o? on sen- 
iority for the mixed group = 6.48? = 41.99, we can solve for the unknown V, the g? for the ex- 


onerated as follows: 


41.99 


Wt 


.13 + [.66 x (36.24) + .34V] , 


V = 52.77, 


114 Academy of Management Journal March 


and thus o = 7.26. The calculation of the standard deviation of performance for the exonerated 
group is: 


U^ exon = .91 x .09, 


Sexon ^ -29- 


The correlation between seniority and performance for the exonerated group is readily obtained 
from the fact that the covariance of the mixed subset is equal to the covariance of the means 
plus the mean covariances. 

The covariance of the nonexonerated is ron exor X perf X Tsen OF 6.02 X .50 X —.18 = 
—.542; the covariance of the mixed group is 6.48 X .48 X . 20 = = ,622. And, the covariance of the 
means is equal to: .66 X [(7.19 — 6.93) x (.64 — :50)] + .34 x [(7.19 — 7.70) x (.91 — .64)] = .07. 
Solving for the unknown covariance (C) of the exonerated group, 


622 = .07 + [(.66 X —.542) + .34 C], 


C 


2.87, 


2.67 
7.26 x .29 


For the exonerated group, our best estimate is thus that r = 1.00. Therefore, with r = 1.00, 
a o for performance of .29, and o for seniority of 7.26, the regression slope b is: 


.28 


b-1.00x = = .040. 
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EFFECTS OF GENDER ON 
SELF- AND SUPERVISORY RATINGS 


LYNN McFARLANE SHORE 
GEORGE C. THORNTON III 
Colorado State University 


This research investigated the effects of supervisors’ and subordinates’ 
genders on self- and supervisory ratings in an organizational setting. 
Participants were assemblers, 35 men and 35 women, and their super- 
visors, 16 men and 19 women. Results showed that subordinates’ self- 
ratings were higher than their supervisors’ ratings of them and that 
gender did not affect the relationship between self- and supervisory 
ratings. In prior research involving novel situations in laboratory 
settings, women provided lower self-ratings and higher ratings of others 
than men. The present study showed no gender differences in ratings on 
familiar tasks in a real work setting in which performance feedback 
was available. 


Concern for minimizing error in ratings has produced many and varied 
approaches to investigating accuracy in performance appraisals (Landy & 
Farr, 1980). The present study synthesized two distinct research approaches 
to accuracy and fairness in performance evaluations., First, much research 
has compared self- and supervisory ratings in work settings in an effort to 
evaluate the accuracy of self-assessments for organizational purposes 
(Thornton, 1980). Second, concern about fair evaluations of men and women 
has led to extensive laboratory research on gender effects in evaluation situa- 
tions (Nieva & Gutek, 1980). The present study integrated these two areas of 
research to address issues of accuracy raised by both literatures. 

"Theory and selected research results suggest that previous laboratory 
findings about gender differences in ratings may have limited generalizability 
to the workplace. Therefore, the present study investigated gender differ- 
ences in a realistic setting. Men and women employees and supervisors in an 
actual work organization participated in the study. 


REVIEW OF LITERATURE AND HYPOTHESES 
Congruence between Self- and Supervisory Ratings 
Many researchers have suggested that self-ratings serve a variety of organi- 


zational purposes (Meyer, 1977; Primoff, 1980). The use of self-ratings has 
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been found to decrease subordinate defensiveness in performance appraisal 
interviews (Bassett & Meyer, 1968) and to increase accuracy in evaluations 
(Bartlett, 1959). Self-ratings also serve as criteria against which to validate 
other assessment procedures, thereby helping to make criterion-related valid- 
ity studies feasible in organizations (Levine, Flory, & Ash, 1977). However, 
concern about the use of self-ratings in organizations has arisen because 
extensive research has shown that self-ratings tend to be more lenient than 
supervisory ratings (Thornton, 1980). Leniency of self-ratings relative to super- 
visory ratings has been found for clerical workers (Parker, Taylor, Barrett, & 
Martens, 1959), technical employees (Kirchner, 1966), nurses (Klimoski & 
London, 1974), first-level supervisors (Waldman & Thornton, 1979), and 
executives (Holzbach, 1978; Thornton, 1968). 

Although most research shows that self-ratings are more lenient than 
supervisory ratings, there have been exceptions (Heneman, 1974; Waldman, 
1979). In fact, several variables appear to influence leniency in self-ratings. 
These include tenure (Brief, Aldag, & VanSell, 1977), age (Lawler, 1967), and 
self-esteem (Baird, 1977). One variable that comparisons of self- and supervi- 
sory ratings have not investigated is gender, even though extensive research 
has shown gender effects in evaluation situations (Nieva & Gutek, 1980). 
Thus, the purpose of the present study was to evaluate the effects of super- 
visors' and subordinates' genders on the congruence of self- and supervisory 
ratings. 


Gender of Subordinates 


Waldman (1979) suggested that the gender of subordinates may affect the 
leniency of self-ratings. Despite the well-documented importance of gender 
as a moderator in many situations (Maccoby & Jacklin, 1974), most stud- 
ies comparing self- and supervisory ratings do not state whether the subjects 
were men or women. However, Dipboye and Flanagan (1979) reviewed all 
the empirical articles from the 1966, 1970, and 1974 volumes of Journal of 
Applied Psychology, Organizational Behavior and Human Performance, and 
Personnel Psychology, and found that field research in industrial-organiza- 
tional psychology has predominately had men as subjects. Thus, although 
most studies have shown that subordinates tend to rate themselves higher 
than their supervisors rate them, this finding may only apply to men. In fact, 
Waldman (1979) found that self-ratings were lower than supervisory ratings 
when a group of women secretaries rated themselves. Since the results of 
that study came entirely from a sample of women subordinates, it did not 
conclusively determine the importance of subordinate gender in compari- 
sons of self- and supervisory ratings. However, Waldman's study did lend 
support to the notion that self-ratings may be more lenient than supervisory 
ratings for men subordinates but not for women subordinates. 

In a comprehensive review of the literature on gender differences in 
attitudes and behavior, Maccoby and Jacklin (1974) concluded that women 
have less positive self-referent attitudes than men in many types of situations. 
Specifically, women were found to manifest lower performance expectancies, 
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self-evaluations of abilities, and self-evaluations of completed performances. 
Lenney’s (1977) literature review of self-confidence in achievement settings 
for men and women drew conclusions that were consistent with Maccoby 
and Jacklin’s. Lenney theorized that in many situations, especially where feed- 
back is absent or where social cues are salient, men tend to overestimate 
their performance and women tend to underestimate their performance. 
Therefore, certain situational variables, such as feedback, which exist in real 
work settings but not in laboratories may influence self-ratings. A subsequent 
study by Lenney, Browning, and Mitchell (1980) showed that women 
underestimated their performance more than men did when performance cri- 
teria were ambiguous but not when performance criteria were clear. Finally, 
Deaux and Farris (1977) found that men evaluated their performance more 
favorably than did women, despite equivalent objective scores. 

All of the literature cited above on gender differences in self-ratings is 
based on laboratory research. We found only two studies that used actual 
employees in organizational settings. Deaux’s study (1979) of managers was 
consistent with the laboratory research in its finding that men not only 
evaluate their performance more favorably than women, but also rate them- 
selves as having more ability and greater intelligence. In contrast, Snyder 
and Bruning (1979) did not find gender differences in self-rated ability to 
perform a job effectively. Overall, most prior research on gender differences 
in self-evaluations suggests that women rate themselves lower than men do. 
However, since most of this research took place in laboratory settings, this 
conclusion may not generalize to research conducted in work environments 
where situational variables like feedback may play important roles. 

The gender of subordinates may also influence supervisory ratings. 
However, the literature in this area is not consistent. Some studies have 
shown that people evaluate men more highly than women (Fidell, 1970; 
Pheterson, Kiesler, & Goldberg, 1971; Rosen & Jerdee, 1974; Schmitt & Lappin, 
1980); other researchers have shown bias favoring women in evaluations 
(Bigoness, 1976; Hamner, Kim, Baird, & Bigoness, 1974; Jacobson & Effertz, 
1974; Mobley, 1982; Taynor & Deaux, 1973); and still others have shown no 
differences in evaluations of men and women (Frank & Drucker, 1977; Hall & 
Hall, 1976). 

After an extensive review of the literature on gender effects in evaluation, 
Nieva and Gutek (1980) theorized that the inconsistency in the literature 
may be due to the varying amounts of information provided to raters regard- 
ing the performance levels of ratees. Specifically, they suggested that raters 
tend to rely more on stereotypes in making ratings when they have little 
information on the actual performance of an individual. Furthermore, Terborg 
and Ilgen (1975) suggested that stereotypes influence evaluations most when 
evaluators know little about individuals they are rating. This is often the 
case in, for example, hiring decisions. They also suggested that the effect of 
gender role stereotypes diminishes as evaluators obtain more information 
about workers. In order to test these theories, we carried out the present 
study in a situation in which raters would not rely so much on stereotypes. 
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Since the information most supervisors have about their subordinates is 
considerable, especially compared to what an average subject in a laboratory 
study has, we expected that the gender of subordinates would not influence 
supervisory ratings in an actual work setting. 


Gender of Supervisors 


Brief and colleagues (1977) suggested that investigations of variables 
that influence the level of agreement between self- and supervisory ratings 
should consider matches between characteristics of supervisors and subordi- 
nates rather than just characteristics of the subordinates. A great deal of 
literature indicates that raters’ genders-may influence ratings of others just as 
subordinates’ genders may influence self-ratings. 

The majority of studies investigating raters’ genders have shown a trend 
for women to rate others higher than do men. Elmore and LaPointe (1974, 
1975) found no differences between men and women students’ ratings of 
teachers, except that women students rated instructors higher on specifying 
course objectives. Lee and Alvares (1977) did not find differences in men 
and women undergraduates’ ratings of supervisors in a simulated industrial 
setting. However, women described the same supervisors as being higher in 
consideration than did men. London and Poplawski (1976) found that un- 
dergraduate women in performance appraisal and interview simulations gave 
higher ratings than undergraduate men. Hamner and colleagues (1974) also 
found that women gave higher ratings than men in a simulated work sampling 
task. In sum, the literature shows a general trend for women to give higher 
ratings than men. On the basis of these findings, we expected that the gender 
of supervisors may influence comparisons of self- and supervisory ratings. 

Increasing numbers of women supervise men as well as women subordi- 
nates (Terborg, 1977; Wexley & Pulakos, 1982). As yet, very little research 
has examined the consequences of this trend. The extent of gender differ- 
ences in self-ratings and supervisory ratings in work organizations remains 
largely undocumented, despite the important implications of such differences. 
Furthermore, the findings of research on gender differences in self- and 
supervisory ratings are inconsistent and derive mostly from research con- 
ducted in laboratory settings. Again, a field study is clearly called for. 
Although most prior research would indicate that women employees rate 
themselves lower than their supervisors rate them, enough theoretical foun- 
dation exists to suggest that a different set of social dynamics takes place in 
applied settings in which women are working successfully at familiar work 
tasks. These dynamics may lead to a different pattern of supervisory and 
subordinate ratings. In light of the lack of field research and the conflicting 
evidence from previous studies on gender differences in ratings, the present 
study investigated the effects of supervisors’ and subordinates’ genders on 
self- and supervisory performance ratings. 

To summarize, a comprehensive review of the literature suggests that 
both supervisors’ and subordinates’ genders may affect levels of agreement 
between self- and supervisory ratings because: (1) men tend to evaluate their 
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performance more favorably than women rate theirs, indicating potential 
gender differences in self-ratings by subordinates; and (2) women tend to 
rate others more favorably than men rate others, showing potential gender 
differences in supervisory ratings. Therefore, the present study investigated 
the following hypotheses: 

Hypothesis 1: Women subordinates will rate themselves 

lower than their supervisors rate them whereas men subor- 

dinates will rate themselves higher than their supervisors 

rate them. 

Hypothesis 2: Women subordinates will rate themselves 

lower than men subordinates will rate themselves. 

Hypothesis 3: Women supervisors will rate subordinates 

higher than subordinates rate themselves, whereas men 

supervisors will rate subordinates lower than subordinates 

rate themselves. 

Hypothesis 4: Women supervisors will rate subordinates 

higher than men supervisors wil] rate their subordinates. 


METHODS 
Participants and Design 


The participants were assemblers and their supervisors in a large elec- 
tronics manufacturing company in the Rocky Mountain region. The assem- 
blers included 35 men and 35 women; the supervisors included 16 men and 
19 women. The average age of the assemblers was 35, and their average job 
tenure was 2.75 years. Only supervisors with both men and women subordi- 
nates were asked to participate in the study; 98 percent agreed to do so and 
each provided performance appraisal information for one man and one woman 
subordinate. Therefore, supervisors with subordinates of only one gender: 
did not participate in the study. Subordinates were selected at random for 
participation, which was voluntary for both supervisors and subordinates. 

A 2 x 2 X 2 factorial design was used with (1) source of rating, supervisor 
or self; (2) gender of subordinate; and (3) gender of supervisor as factors. The 
dependent measure was the rating of subordinate’s job performance. 


Questionnaire 


The questionnaire had two sections, a cover letter and a performance 
appraisal form. Supervisors’ and subordinates’ questionnaires had slightly 
different wordings since subordinates rated themselves and supervisors rated 
subordinates. 

The purpose of the cover letter was to provide supervisors and subordi- 
nates with general instructions and information. Each participant’s cover 
letter bore a unique identification number that matched a corresponding 
master list of names held only by the researchers at Colorado State University. 
Respondents were told that the purpose of the identification number was to 
make sure both members of a supervisor-subordinate pair had participated. 
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They were also told that researchers from the University would contact them 
if their forms were not received. For supervisors, cover letters also said 
which of their subordinates to evaluate. All participants were informed that 
their appraisals would remain strictly confidential and would not affect the 
regularly scheduled performance planning and review process. 

The performance appraisal form was very similar to the form regularly 
used in the firm for evaluating the performance of assemblers. However, 
some wording changes were necessary for self-ratings because the company 
did not use self-ratings. The performance appraisal form had six dimensions: 
(1} work quality, (2) work quantity, (3) judgment, (4) initiative, (5) teamwork, 
and (6) dependability. Subordinates were rated on a 1- to 5-point scale, 
where 1 indicated performance below the minimum acceptable level and 5 
indicated that performance consistently far exceeded expectations. 


Data Collection 


All subordinates and corresponding supervisors received cover letters 
and performance appraisal forms. Supervisors were instructed to complete 
an appraisal for each of their subordinates participating in the study. Subor- 
dinates were asked to complete self- appraisal forms. Subordinates and super- 
visors were told to put their completed questionnaires into envelopes pro- 
vided them, seal the envelopes, and mail them to the personnel department. 
We contacted those who did not return their questionnaires to urge their 
participation and got back a total of 98 percent of the questionnaires. Upon 
receiving the completed questionnaires, we mailed a letter to each of the 
participants explaining the purpose and results of the study. 


RESULTS 


Correlations among the six performance dimensions on the performance 
appraisal form resulted in values ranging from .29 to .60. Since many of 
these correlations were low, we concluded that it would be inappropriate to 
combine the six dimensions into an overall index of job performance (James, 
1973). Thus, we analyzed the performance dimensions separately for all of 
the hypotheses. 

A split plot factorial design (Kirk, 1982) was used to analyze the results 
because confounding of groups of supervisors and genders of supervisors 
was unavoidable even though all treatments were crossed. Supervisor's gen- 
der was a between-blocks factor, with some supervisors men and others 
women, since supervisor's gender was confounded with differences between 
blocks or groups of supervisors. Subordinate's gender and source of rating 
were within-blocks factors, and thus not confounded, because (1) each 
supervisor rated one man and one woman subordinate, and (2) for all subordi- 
nates, ratings were obtained from both the individuals and their supervisors. 

Table 1 presents mean performance ratings for all conditions. The six 
performance dimensions were conceptually related to each other since they 
were all facets of perceived work performance. Therefore, a multivariate 
analysis of variance was an appropriate first analysis. If the multivariate test 
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was significant, we performed separate univariate analyses of variance tests 
for each performance dimension to determine which dimensions contributed 
to the significant multivariate result. A multivariate analysis of variance 
showed a significant main effect for source (a oi = 4.13, p < .01), indicating a 
general trend of greater leniency in self-ratings (X = 3.92) than in 
supervisory ratings (X = 3.55). Mean self-ratings were higher than mean 
supervisory ratings for the six performance dimensions for all combinations 
of supervisor's gender and subordinate's gender. Univariate analyses of vari- 
ance (Table 2) showed a significant main effect for source for each of the per- 
formance dimensions. 

The interaction between source and gender of subordinate was not signifi- 
cant (multivariate Fg ou = .46), indicating that subordinate's gender did not 
affect leniency of self-ratings relative to supervisory ratings. Because this 
multivariate result was not significant, we conducted no univariate analyses. 
As predicted in Hypothesis 1, men rated themselves higher than their super- 
visors rated them. Contrary to the predictions of Hypothesis 1, women did 
not rate themselves lower than their supervisors rated them. In fact, women 
tended to rate themselves higher than their supervisors rated them. The 
nonsignificant interaction for source X gender of subordinate also shows that 
subordinate's gender did not affect self-ratings, as predicted in Hypothesis 2. 
Moreover, men and women subordinates rated themselves similarly for all of 
the performance dimensions. 

The interaction between source and supervisor's gender was not signifi- 
cant (multivariate Fs o; = .23). This indicates that supervisor's gender did not, 
as predicted in Hypothesis 3, affect leniency of self-ratings relative to super- 
visory ratings. Both men and women supervisors rated their subordinates 
lower than subordinates rated themselves. Further, the nonsignificant inter- 
action between source and supervisor's gender indicates that there were no 
significant differences in the ratings made by men and women supervisors, 
contrary to predictions in Hypothesis 4. 

An exploratory area of research was the potential three-way interaction 
for source X supervisor's gender X subordinate's gender. This interaction was 


TABLE 2 
Comparison of Mean Differences between Self- and Supervisory Ratings? 
Self-Ratings Supervisory Ratings 

Performance Dimensions Means s. d. Means s. d. F 
Work quality 3.93 77 3.67 65 7.40* 
Work quantity 3.67 .65 3.44 79 7.98* 
Judgment 3.89 73 3.36 82 19.19** 
Initiative 3.94 76 3.54 .76 15.24** 
Teamwork 4.00 74 3.67 76 8.61* 
Dependability 4.03 .76 3.61 92 15.08** 

DN = 70 

*p<.01 


**p«.001 
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not significant (multivariate Fe,sı = 1.26), indicating that there were no differ- 
ences in self- or supervisory ratings between same-gender and cross-gender 
supervisor/subordinate dyads. 

The focus of the present study was the level of agreement between self- 
and supervisory ratings, which we tested by the effect called source. The 
main effects for gender of supervisor and gender of subordinate were not 
meaningful since they combined self- and supervisory ratings. For example, 
for the main effect for gender of supervisor, we combined ratings done by 
men supervisors with their subordinates’ self-ratings to make one group and 
then compared these with a second group consisting of women supervisors’ 
ratings combined with their subordinates’ self-ratings. The same problem 
existed for the main effect gender of subordinate. In any event, these effects 
were not found to be significant. 


DISCUSSION 


Consistent with previous research (Thornton, 1980), the significant main 
effect for source indicates that subordinates tended to rate themselves higher 
than their supervisors rated them. The effect was consistent for both men 
and. women subordinates regardless of supervisor’s gender. This suggests 
that none of the variables investigated in this study influence leniency of 
subordinate ratings relative to supervisory ratings. 

The lack of gender differences in both self- and supervisory ratings in 
the present study is not consistent with most previous research (Lenney, 
1977; Maccoby & Jacklin, 1974; Nieva & Gutek, 1980). However, most previ- 
ous research took place in laboratory settings whereas the present study had 
an industrial setting. Perhaps situational factors such as prior feedback that 
are present in applied settings but not in laboratories account for this 
discrepancy. The literature suggests two potential explanations of why the 
present study of working employees and supervisors might have produced 
ratings that were inconsistent with previous laboratory research: (1) gender 
differences in self-confidence, as expressed in performance ratings, occur in 
novel situations; and (2) stereotypes influence self- and supervisory ratings 
in situations for which information on performance is lacking. 

Maccoby and Jacklin (1974) suggested that women are lower in self- 
confidence than men in many achievement situations. Their conclusion is 
based on the consistently lower performance expectancies and self-evalua- 
tions of both abilities and completed performance exhibited by women than 
by men. They thus assumed that lower self-ratings reflected lower self-con- 
fidence. However, Lenney (1977) suggested that gender differences in self- 
confidence are not likely to occur when individuals obtain clear information 
on their abilities to do specific tasks. A subsequent study by Lenney, Brown- 
ing, and Mitchell (1980) concluded that women state lower expectancies 
than men when estimating their future perfermance on unfamiliar tasks, but 
do not do so for tasks on which they have received prior performance feed- 
back. Thus, the effects of prior feedback on men’s and women’s self-confi- 
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dence may explain the inconsistency of our findings on self-ratings with the 
findings of previous laboratory research. 

The lack of gender differences we found may also be a result of women’s 
low self-confidence on unfamiliar tasks. Prior research showing gender dif- 
ferences in ratings of others used undergraduates as subjects in simulated 
work settings (Hamner et al., 1974; London & Poplawski, 1976). The rating of 
others was a novel task for the undergraduates in those studies. Perhaps the 
higher ratings of others made by women compared to men reflected women’s 
low self-confidence in their abilities to rate effectively in novel situations. In 
contrast, all of the supervisors participating in the present study had prior 
experience in performance rating. Both men and women supervisors were 
thus equally likely to feel confident in their abilities to rate others. 

Gender roles stereotypes may have an important influence on perfor- 
mance ratings in some settings (Nieva & Gutek, 1980; Terborg & Ilgen, 1975). 
Terborg and Ilgen suggested that the effect of stereotypes is most potent in 
situations in which actual performance information is lacking and that stere- 
otyping decreases as information is obtained. People use stereotypes to infer 
information about performance in the absence of better information. Lenney 
and her coauthors (1980) suggested that without external feedback, society’s 
stereotype that women are less competent than men across a variety of situa- 
tions is likely to influence self-evaluations. Therefore, women’s low self- 
ratings in laboratory research may be due to their use of stereotypes in 
making ratings in the absence of performance feedback. In contrast, neither 
Snyder and Bruning (1979) nor the present study found gender differences in 
self-ratings; both these studies involved the behavior of men and women in 
actual work settings where feedback on performance was provided. 

Performance feedback may not be the only variable that differentially 
influences men’s and women’s self-evaluations. Two field studies suggest 
variables that may interact with gender to affect self-ratings even when 
people have had prior performance feedback. First, status differences between 
supervisors and subordinates may produce low self-ratings among women. 
Waldman’s (1979) study in the university setting of women secretaries’ self- 
ratings compared with their faculty supervisors’ ratings found that self-ratings 
were lower than supervisory ratings. In the present study, the first-level 
supervisors who participated had previously been assemblers, so we would 
not expect status differences. l 

Second, a job’s gender-appropriateness may interact with gender to affect 
self-ratings. Deaux’s (1979) study showing lower self-evaluations by women 
than men used first-level supervisors as subjects. Supervisory positions have 
been traditionally held by men. The assemblers participating in the present 
study were not in a stereotypically masculine field, and self-ratings of men 
and women were not different. 

Use of stereotypes may also explain the lack of gender differences in 
supervisory ratings in the present study. Society’s common view of women 
as nurturing, warm, and emotionally supportive (O’Leary, 1974) may influ- 
ence women’s ratings of others when they lack performance information. 
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Thus, women may rate others more leniently than men do in laboratory 
experiments or in selection decisions for which performance data are lacking. 
In contrast, actual work settings provide extensive performance information 
for evaluating current employees so that stereotypes should not influence 
supervisory ratings. 

An important finding of the present study was the lack of a three-way 
interaction between supervisor’s gender, subordinate’s gender, and source of 
ratings. This is consistent with the results of other field studies that have 
found no interaction between gender of supervisor and gender of subordinate. 
Two studies showed no such interactions for supervisory ratings of subor- 
dinate’s job performance (Mobley, 1982; Peters, O’Connor, Weekley, Pooyan, 
Frank, & Erenkrantz, 1984). A third (Wexley & Pulakos, 1982) showed no 
such interaction for ratings of supervisors by their subordinates or attitudinal 
congruence between supervisors and subordinates. Therefore, it appears that 
interactions between gender of supervisor and gender of subordinate may 
not affect performance ratings of actual employees. 

A great deal of the preceding discussion focused on potential reasons 
for the nonsignificant effects of gender on self- and supervisory ratings in the 
present study. Another pertinent issue is the power of the statistical analyses. 
We conducted a power analysis (Cohen, 1977) to determine whether the 
present study had adequate power for detecting significant effects, should 
they exist. 

Power analyses showed that for the main éffect, source, and for the two- 
and three-way interactions between source and gender of supervisor and 
gender of subordinate, power was quite adequate for medium (.50) and large 
(.80) effect sizes. Power ranged from .79 to .995. If the size of an effect were 
medium or large, it is highly likely that the present analyses would have 
detected differences between men and women respondents, and between 
supervisors’ ratings and self-ratings. However, if the effect size were small 
(.20), only the main effect for source of ratings had sufficient power to insure 
the likelihood of detecting group differences (power = .665). For the inter- 
actions, if the effect size were small, power was only .39 for source X gender 
of subordinate, .36 for source X gender of supervisor, and .19 for source x 
gender of subordinate X gender of supervisor. 

In sum, the present study and other studies investigating interactions 
between supervisors' genders and subordinates' genders in real work set- 
tings have a potential limitation. Small effects may not be detectable owing 
to limitations of the populations being studied. Unfortunately, there are 
probably few work situations in which the number of women supervisors 
with men subordinates would allow the detection of small-sized effects. 
Laboratory studies may therefore be detecting small gender differences that 
are not discernable in applied research. Although small gender differences 
should certainly be documented, larger gender differences are of greater 
practical significance to organizations. The present research suggests that if 
gender does influence performance ratings, that effect is in all likelihood 
small. 
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One implication of the present study is that gender differences in self- 
and supervisory ratings may occur only in novel situations. With feedback 
on skill at performing a task, whether it be performance on the task itself or 
rating others’ performance, gender differences in self-ratings and ratings of 
others may not occur. Thus, it is important that future research examine the 
effects of feedback on self- and supervisory ratings. Further, an investigation 
of whether self-confidence or stereotyping are related to gender differences 
in rating would be valuable. 

Another area of future research the current study suggests is an examina- 
tion of situational variables that may affect the relationship between self- 
and supervisory ratings. Personal characteristics such as gender may not 
have the direct relationship with ratings that many laboratory studies seem 
to suggest. Gender differences in ratings may operate only in some work 
situations—for example, when there are status differences, when a job is 
stereotypically masculine or feminine, or in certain occupations. The pres- 
ent study seems to suggest that situational variables such as prior experience 
or amount of previous feedback may interact with gender to affect the leniency 
of self-ratings compared to supervisory ratings. 


CONCLUSIONS 


This study clearly supports the value of testing laboratory findings in 
work settings. Since work environments are much more complex than labora- 
tories it is important to determine whether conclusions based on lab studies 
are applicable in work settings. Of course, every work situation is unique, 
and results from research in one work setting may not apply to others. It is 
especially critical to evaluate the generalizability of studies of perceived 
performance since perceptions are likely to be influenced by aspects of the 
situation such as prior feedback, gender-typing of a job, or organizational 
type. 

There are several important implications of this study for managers. 
First, managers need to be aware that employees tend to evaluate their own 
performances more highly than their supervisors evaluate them. Even when 
organizations do not use self-ratings as part of their appraisal process, employ- 
ees will still evaluate their supervisors’ appraisals relative to their own per- 
ceptions of their job performance. Discrepancies between self- and supervi- 
sory evaluations may well damage employees’ self-esteem. Communication 
between managers and employees regarding perceptions each has of employee 
job performance should help remedy discrepancies. Second, managers need 
to evaluate gender differences in performance ratings for their particular 
setting. Until more research is done assessing the generalizability of the 
present findings across other organizations and jobs, no firm conclusions 
can be made regarding gender differences in performance ratings among 
employees. Finally, managers may be able to minimize gender differences in 
performance ratings by providing employees with accurate and realistic feed- 
back about work-related performance, or by encouraging day-to-day interac- 
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tions between employees and their supervisors so that supervisory evalua- 
tions are based on performance-related information rather than stereotypes. 
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USING THE LENS MODEL TO MAGNIFY RATERS’ 
CONSISTENCY, MATCHING, AND SHARED BIAS 
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Performance ratings have received increased attention in the last decade 
partly because of their importance for many personnel functions, but also 
because the judgments required are interesting cognitive phenomena (Ilgen 
& Feldman, 1983). Differences among the ratings given by supervisors, peers, 
and subordinates are commonly noted (Kavanagh, McKinney, & Wolins, 
1971). These differences could be due to a variety of reasons, including (1) 
differing opportunities to observe behavior and (2) different types of raters 
using different cognitive schemata to evaluate performance. Supervisors, 
peers, and subordinates are likely to have different frameworks for integrat- 
ing information concerning performance, which results in variance in dimen- 
sional and overall performance ratings within and across rating groups. 

Policy-capturing techniques have been used to describe how raters com- 
bine information on individual performance dimensions to arrive at overall 
ratings of performance (Hobson, Mendel, & Gibson, 1981; Zedeck & Kafry, 
1977). Capturing the policies of various groups of raters also allows compari- 
sons concerning how different raters or rating groups use each type of infor- 
mation in making performance ratings. Understanding how people use cer- 
tain types of information can pinpoint more precisely what kinds of factors 
to emphasize when training raters. 

Much of the policy-capturing literature originated with a model of per- 
ceptual process presented by Brunswik (1952). Brunswik was interested in 
how people's perceptions of their world coincided, or failed to coincide, 
with objective reality. He presented his ideas pictorially as a lens model. 
Policy-capturing research generated by this lens model has ignored various 
aspects of the model that may be of potential value in examining interrater 
agreement among raters or rater groups. Specifically, we argue that lack of 
agreement may occur for two reasons. The first is that individuals do not 
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apply any policy consistently. Consistency is the problem usually investi- 
gated by policy-capturing research. Second, lack of agréement may occur 
because individuals fail to apply similar policies. We further argue that 
interrater agreement may be too high when it exceeds values predicted by 
the lens model because of shared rater biases. Finally, scholars have criti- 
cized policy-capturing research for using hypothetical performance profiles 
rather than data collected in actual performance-rating sessions (Hobson & 
Gibson, 1983). 

In our Methods section we describe a methodology based on the lens 
model that allows analysis of (1) the degree to which raters adopt similar 
strategies of combining information, or matching; and (2) the degree to which 
their agreement is a function of job-relevant considerations or a function of 
shared biases. We then provide an empirical example and analyze it using 
these lens model indices. 


METHODS 
Lens Model Conceptualization of Interrater Agreement 


Figure 1 presents a pictorial representation of our adaptation of the lens 
model for two raters. Multiple raters or rater groups would simply involve 
more indices of the same type as appear on either side of the lens model. One 
major assumption underlying the lens model is that raters’ judgments of 
ratees’ job performance reflect the same individual] performance dimensions 
relevant to the overall performance of a job. An additional assumption is that 
a linear model represents all judgments adequately. Individual performance 
is rated on a number of dimensions (Dj. . .D,). In our example, the individ- 
ual values of r, D, represents cartellone between performance ratings on 
major job performance dimensions and an overall rating of job performance, 
where y; corresponds to the overall performance rating and D; equals the 
rating given to the jth dimension. The multiple correlation resulting from the 
regression of a rater’s overall performance ratings on measures of individual 
performance dimensions represents the degree to which information on these 
individual dimensions contributes to assessments of overall job performance. 
This provides an index of consistency. Using the regression equations to 
predict values of overall performance (Y, and Y) and then correlating these 
two sets of predicted values yields a matching index (ry y,). This index, 
largely ignored in policy-capturing research focusing on performance ratings, 
represents the degree to which decision makers use similar strategies when 
they are combining information. 

For the purposes of this study, we adapted Slovic and Lichtenstein’s 
(1971: 657) lens model equation. Our formula for the observed correlation 
between ratings is as follows: 


TYY, 7 TY, Y, By, Ké Ry, Y, g DI 
Equation 1 states that the correlation between two raters’ judgments of a 
ratee's overall performance is a multiplicative function of (1) the degree to 
which the two raters use similar strategies of combining information (ez 
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FIGURE 1 
Lens Model Representation of Interrater Reliability for Two Raters 
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and (2) the degree to which raters use available information consistently in 
coming to overall evaluations of performance (Ry y, and Ry y ). Analyzing 
interrater reliability in this fashion provides information concerning several 
interesting practical questions. First, are decision makers using similar in- 
formation in varying situations and for different purposes? Second, to what 
extent are decision makers using this information in coming to overall judg- 
ments of performance? Third, what explains differences in the decisions of 
different sets of decision makers—lack of consistent information use or failure 
to adopt similar strategies? 

The possibility exists that observed interrater reliabilities will exceed 
those predicted in Equation 1. This could be due to curvilinear use of infor- 
mation when decision makers make overall judgments. Alternatively, it 
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could represent some linear relationship among the residuals caused by the 
availability of similar information to various decision makers rating job 
performance. This would be information not correlated with the dimensions 
that are measured. Assuming that a relatively thorough job analysis pro- 
duces an adequate criterion and a reasonably complete set of performance 
dimensions, we could most likely attribute such a linear relationship to bias. 
It is important to note that bias can occur in the individual dimensional 
ratings as well. Two raters, or two groups of raters, may make both overall 
judgments and dimension judgments with similar biases. Our analysis does 
not deal with that problem, which requires some type of independent dimen- 
sional true score (Borman, 1978). 

One possible source of such dual contamination or bias may be common 
stereotypes associated with ratees’ demographics. In our example, we esti- 
mated this contamination using the following equation: 


ron = Bonge, [-R’y,y,) G-R’y,r JI”, (2) 
where rpaywc equals the incremental intercorrelation that occurs through 
the demographic correlates of the residuals and Rppgs,, is the multiple cor- 


relation between the demographics and the two sets of residuals involved. 

The residual-demographic correlations result in increases in interrater cor- 
relations equal to the product of the multiple correlation between the resid- 
uals and the demographic variables and the residuals themselves. 

Next, we present an example to demonstrate how policy-capturing tech- 
niques can be used with actual performance ratings to illuminate three issues. 
The first is consistency, how raters combine information on individual per- 
formance dimensions to make overall performance ratings. The second we 
call matching, the extent to which different rating groups use similar strate- 
gies in making judgments about levels of overall performance. The third is 
the degree to which common biases may inflate agreement on such overall 
judgments. In particular, we illustrate how we used the Brunswik (1952) 
lens model to analyze three different groups of raters’ judgments of the 
overall performance of a group of school administrators. 


Participants 


Performance ratings were collected for 153 school administrators in five 
different school districts throughout the country. These ratings were used as 
criteria, for research purposes only, in the validation of a recently developed 
assessment center designed to select school administrators. These adminis- 
trators included 91 men and 62 women, 129 of whom were white and 24 of 
whom were nonwhite. 


Measures 


Performance ratings were collected on 15 performance dimensions using 
behaviorally anchored rating scales. Raters were also asked to make overall 
performance ratings. Ratings of administrators’ performance were completed 
by their immediate supervisors, two teachers, and the administrators them- 
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selves. Because of their high intercorrelation, the two teachers’ ratings were 
averaged to provide a composite teacher rating. Table 1 presents short descrip- 
tions of the 15 performance dimensions. The full scales are available upon 
request from the senior author. 


Data Analyses 


The first step in the data analysis involved the regression of the overall 
performance rating on the performance dimensions for each of the rater 
groups. The other correlational indices (ry y, and ry,¢,) suggested by Figure 
1 were computed as well as the correlations between the residuals of the re- 
gression equations and candidates’ races, genders, and positions. 


RESULTS 


Intercorrelations among ratings on the 15 performance dimensions were 
relatively high. The intercorrelations among self-rating dimensions ranged 
from .06 to .64 and averaged .35. The range of intercorrelations for the 
supervisor ratings was .45 to .79 with an average of .65. The average inter- 
correlation among teacher ratings was .69 and the range was .51 to .87. We 
do not present full intercorrelation matrices because of space considerations, 
but they are available from the senior author. 


TABLE 1 : 
Results of Regression Analysis of Overall Job Performance 
on Performance Dimensions for Ratings by Supervisor, Self, and Teachers 


Beta Weights for Prediction of Overall Judgments by: 


Self Supervisor Teacher 
Performance Dimensions (n— 123) (n — 86) (n 79) 
Monitoring curriculum objectives .204* 114 .139 
Monitoring student progress .193* .013 .048 
Supervision of student activities .103 .148 .057 
Participation in student activities -093 —.026 -.039 
Direction of support services —.100 .181* .104 
Directing behavior of students .004 .224* .163 
Staff evaluation .108 —.052 .178* 
Developmental activities —.033 111 .029 
Community relations .135 —.066 .003 
Interpersonal effectiveness —.023 .046 .007 
Relationship with parents 117 .051 .162* 
Coordination with district .305* O11 009 
Fiscal management —.194* —.086 .035 
Maintenance of school plant .060 .063 .064 
Structure communication .120 .317* 170 
Multiple R .79* .88* .93* 
Multiple R? .62 27 .86 
Adjusted R? EH .72 .82 





*p<.05 
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Table 1 presents the results of the regression analysis for the three rater 
groups. All three sets of overall performance ratings were highly predictable 
on the basis of ratings of individual performance dimensions. The three 
vectors of regression weights were compared using the Chow (1960) test. 
These tests indicated that the regression coefficients associated with the 
supervisors' judgments were significantly different from those of the teachers' 
judgments (F = 3.11, df = 16,133, p<.01). Similarly, the regression equations 
for self- and supervisors’ judgments were significantly different (F = 3.30, 
df = 16,177, p<.01) as were the equations for self- and teachers’ judgments 
(F = 6.99, df = 16,170, p<.01). 

Examination of the standardized regression coefficients from Table 1 
indicates that various dimensions best predicted overall ratings by different 
groups of raters. The ratees themselves seemed to most heavily weight efforts 
directed at curriculum, student progress, fiscal matters, and district-school 
coordination problems. Supervisors'overall judgments were most related to 
judgments of ratees' efforts to direct support services and student behavior, 
and to structure communication. Teachers’ ratings of the overall perfor- 
mance of their principals was most heavily influenced by the principals' 
efforts in staff evaluation and relationships with parents. An examination of 
the dimension descriptions and those dimensions given the most weight by 
different rater groups reveals that raters are weighting those dimensions that 
would seem most salient to them. For example, it seems reasonable that 
teachers would be concerned about staff evaluation and harmonious relation- 
ships with parents. However, we make all such statements cautiously, given 
the high degree of multicollinearity in the ratings. 

In Table 2, we present the remainder of the correlations suggested by 
Figure 1. The observed correlations among overall ratings provided by differ- 
ent rater groups are low but about what would seem likely from the research 
literature. As Table 1 reported, the multiple correlations relating overall 
ratings to dimension ratings are uniformly high. However, the matching 
indices are low, which indicates that individuals in these different rater 
groups used different weights in combining information to come to judg- 
ments concerning overall performance of principals. Finally, predicted 
intergroup correlations using Equation 1 are lower than the observed 
correlations. 

As suggested earlier, this discrepancy could occur because different sets 
of judges use information in a similar nonlinear fashion or because different 
sets of judges use information that the various performance rating dimen- 
sions do not include. To uncover a possible source of such dual contamina- 
tion, we correlated five demographic variables with the residuals of those 
ratings. The demographic variables used in this analysis included the races 
and genders of candidates and three pieces of information concerning their 
current employment status: (1) the school level at which they were currently 
working, (2) whether they had been newly promoted from teaching, and (3) 
whether they had been assistant principals. The last two items were coded 
dichotomously. 
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The cross-rater correlates of residuals suggested by Equation 2 provide 
useful information about possible rater bias that is not relevant to measured 
job performance dimensions. The column headed “Equation 2” in Table 2 
presents incremental intercorrelations among rater groups resulting from the 
cross products of the multiple correlations between demographic variables 
and the residuals. We used multiple correlations because of the demo- 
graphics' lack of independence. As can be seen, substantial additional cor- 
relation represents an inflation of interrater correlation because both rating 
groups share similar biases. 


SUMMARY AND CONCLUSIONS 


In this paper, we have presented an interpretation in terms of the lens 
model of the relationship among ratings supplied by different groups of 
raters. We used the model (1) to determine the degree to which overall 
performance ratings were predictable from information available to the raters; 
(2) to examine similarities in the weighting systems used by different groups 
of raters; and (3) to examine potential inflation of interrater reliability through 
bias. 

The analysis provides information that is useful for increasing the effec- 
tiveness of any performance appraisal system. Analysis of equations relating 
overall performance ratings to dimensional ratings provides two types of 
information. First, it reveals what rating dimensions most strongly influence 
overall performance ratings. Second, it shows the extent to which raters' 
overall evaluations of performance are based on dimensions other than those 
involved in appraisal forms. Because an overall performance rating is often 
administratively the most important piece of information in salary and pro- 
motion decisions, it is important that it be unbiased. Two conditions identi- 
fied by use of the lens model may indicate a need for training of raters. If 
regression of the overall performance rating on dimensional ratings results 
in a small multiple R, it may indicate that raters are making overall ratings 
on the basis of other factors, such as gender or race, or that they are inconsis- 
tently using the dimensional information. Examination of matching indices 
as well as standardized regression weights allows for a comparison of differ- 
ent rater policies. Application of a different policy is psychologically differ- 
ent from inconsistent application of a policy and would dictate a different 
rater training strategy. 

Besides identifying possible needs for rater training, the lens model 
analysis provides an objective statistical description of rater behavior. Shar- 
ing information with raters concerning (1) the extent to which their ratings 
are similar to other organizational members' ratings and (2) the importance 
of each performance dimension in determining their overall ratings can help 
them evaluate the extent to which their actual rating behavior reflects their 
intended behavior. Raters may modify their individual decision making pro- 
cesses to fit organizational desires if they receive objective analyses of how 
their rating behavior compares to that of others in their rating group and to 
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organizational policy. However, this hypothesis requires empirical verifica- 
tion. 

Examination of the residual correlations can explain bias. Demonstrating 
that the residuals of equations relating individual performance dimensions 
to overall appraisals of job performance are correlated through common 
demographic correlates could be used to explain or correct inflated inter- 
rater correlations. In this context, we should emphasize that finding com- 
mon demographic correlates of the residuals would not preclude correspond- 
ing differences on both dimension ratings and overall ratings. 

The model outlined in this study could be applied to other problems in 
the human resource area. If performance dimensions parallel those dimen- 
sions on which selection decisions are made, as they should in job-relevant 
systems, organizations could compare decisions concerning selection and 
performance appraisals the way we compared rater groups. Analysis of the 
correlates of the residuals of the selection and performance appraisal equa- 
tions could reveal the source of predictor-related criterion bias. This prob- 
lem has received little attention, though it has obvious implications for the 
fairness of selection decisions. Analysis of job evaluation decisions, job anal- 
ysis data, and assessment center judgments are other potential areas of 
application. 
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The roles of task uncertainty within subunits and interunit interdepend- 
ence in explaining variations in organizational designs have been subject to 
much theorizing (Fry, 1982; Gerwin, 1981; McCann & Galbraith, 1981). Previ- 
ous research has found task uncertainty in subunits to be related (1) to 
personal means of intraunit coordination (Hrebiniak, 1974; Mohr, 1971; 
Pennings, 1975a; Tushman, 1979a; Van de Ven, Delbecq, & Koenig, 1976); (2) 
to boundary-spanning activity (Leifer & Huber, 1977; Tushman, 1977); and 
(3) to such facets of intraunit structuring as participation in decision making . 
(Mohr, 1971) and autonomy in task performance (Schoonhoven, 1981). Perrow 
(1967) identified two components of task uncertainty—number of exceptions . 
and analyzability—that are conceptually distinct but empirically related 
(Withey, Daft, & Cooper, 1983). The present study focuses on the analyzabil- 
ity component of task uncertainty, which we labeled task difficulty. Task 
difficulty refers to knowledge of cause-effect relationships (Van de Ven & 
Delbecq, 1974). Part of this study relates task difficulty to boundary-spanning 
activity, participation, and autonomy. 


The authors wish to thank Louis Fry, Kalburgi Srinivas, and two anonymous reviewers for 
their helpful comments on earlier drafts of this paper. 


140 Academy of Management Journal March 


Little empirical work on the relationship between interdependence and 
coordination mechanisms exists (Cheng, 1983). Only Tushman (1977, 1978) 
studied linkage between interunit interdependence and boundary-spanning 
activity; these studies concerned relationships between laboratory units and 
other units in a high technology firm. The present study also relates interde- 
pendence between units to interunit boundary-spanning activity in the con- 
text of an administrative system; it therefore complements Tushman’s 
research. 

No study has specifically addressed the relationship between boundary- 
spanning activity, one mechanism that coordinates relations betwéen units, 
and elements of intraunit structuring. Research addressing these linkages 
has either combined forms of boundary-spanning activity or has used global 
measures of interaction (Aiken & Hage, 1968; Bacharach & Aiken, 1977; 
Hrebiniak, 1974; Leifer & Huber, 1977; Mohr, 1971). Finally, this study relates 
boundary-spanning activity to participation in decision making and to 
autonomy. 


THEORETICAL ANALYSIS AND HYPOTHESES 


Information processing, defined as “the gathering, interpreting, and syn- 
thesis of information in the context of organizational decision-making" 
(Tushman & Nadler, 1978: 614), has served to explain variations in organiza- 
tion designs. The capacity to process information is a function of a unit's 
design that includes (1) mechanisms linking units and (2) intraunit struc- 
turing. Task difficulty and interunit interdependence are two factors that 
determine requirements for information processing capacity. Because require- 
ments and capacity should match, these factors are linked to the designs of 
work units (Galbraith, 1973, 1977; Tushman & Nadler, 1978). 


Boundary Spanning Activities 


Organizational subunits manage their interunit interdependencies 
through mechanisms that vary in information-processing capacity (McCann 
& Ferry, 1979; Thompson, 1967). Programming, although the least costly 
mechanism, can only deal with predictable events in which cause-effect 
relations are clear (Connolly, 1977; Galbraith, 1972). Supervisors, acting as 
boundary spanners, may deal with interunit decisions not falling within 
procedures. An alternative mechanism, direct lateral contact among subor- 
dinates, may provide less control and be more costly than supervisory bound- 
ary spanning (Galbraith, 1972). Coding or linguistic barriers (Tushman, 1979a) 
may also hinder contact among subordinates. However, if information- 
processing requirements exceed supervisors' capacities, organizations use 
supplementary mechanisms (Thompson, 1967; Tushman & Nadler, 1978; 
Van de Ven et al., 1976). These mechanisms include boundary spanning by 
unit members through gatekeepers, in instances in which differentiation is 
high (Tushman, 1977; Tushman & Scanlan, 1981), and boundary spanning 
through direct contact where such barriers are low (Leifer & Huber, 1977). 
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Such transactions may range from one-on-one meetings to joint problem- 
solving teams (Van de Ven et al., 1976) and may be assisted by third parties 
(Lawrence & Lorsch, 1967). 

McCann and Ferry stated, “Interdependence exists when actions taken 
by one referent system affects [sic] the actions or outcomes of another re- 
ferent system” (1979: 113). In this paper, our concern was the degree of 
policy and procedural interdependence between focal units and other units 
as indicated by changes that other units induced in focal units.! The greater 
the interdependence, the greater would be the need to gather, analyze, and 
distribute information between subunits. Boundary-spanning activity by unit 
members, supplementing the limited capacity of rules and hierarchy 
(Tushman, 1977, 1978), may meet this need for information processing. 
Therefore, we expect that: 

Hypothesis 1: The greater the degree of interunit interde- 
pendence, the greater the amount of boundary-spanning 
activity by unit members. 

It is a central theme in previous research that the greater the task difficulty, 
the greater the requirement for information-processing capacity (Duncan, 
1973; Galbraith, 1977; Tushman & Nadler, 1978). Lateral communications 
that supplement programming and hierarchy in providing advice and infor- 
mation to unit members in task performance may meet such requirements 
(Burns & Stalker, 1961). These networks would provide (1) reduced equivo- 
cality of information (Daft & Macintosh, 1981); (2) conceptual frameworks 
(Connolly, 1977); (3) social validation of decisions (Connolly, 1977; Litwak, 
1961); and (4) coordination (March & Simon, 1958; Thompson, 1967). 

When interunit interdependence is low, these networks may be restricted 
to intraunit transactions, with supervisors managing interfaces between units. 
However, with high levels of interdependence, expanding networks to include 
members from other units may facilitate needed extraunit information 
exchange (Galbraith, 1972). Therefore, since the level of interunit interde- 
pendence for most of the work units studied was moderate or high, we 
propose that: 

Hypothesis 2: The greater the degree of task difficulty, the 
greater the amount of boundary-spanning activity by 
unit members. 


Intraunit Structuring 


Previous research has developed and tested linkages between task diffi- 
culty and participation and has found varying levels of support (Hage & 
Aiken, 1969; Hrebiniak, 1974; Grimes, Klein, & Schull, 1972; Mohr, 1971). 
Briefly, participation by subordinates enhances supervisory capacity to gen- 
erate and evaluate alternatives (Duncan, 1973). The more difficult the tasks, 
the greater the vesting of cause-effect knowledge in subordinates, hence the 


1 For other frameworks, see McCann and Ferry (1979); Pennings (1975b); Thompson (1967); 
and Van de Ven et al. (1976). 
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greater their potential contributions to decision making. Therefore, we hypoth- 
esize that: 

Hypothesis 3: The greater the degree of task difficulty, the 

greater the level of participation in decision making by 

unit members. 

The relationship between task difficulty and discretion at the point of 
task execution enjoys considerable theoretical and empirical support (Bell, 
1967; Burns & Stalker, 1961; Grimes et al., 1972; Hrebiniak, 1974; Schoon- 
hoven, 1981; Thompson & Bates, 1957). When task difficulty is low, subordi- 
nates may refer exceptions upward; but as cause-effect relationships become 
less tangible, units may expand information processing capacity by increas- 
ing discretion at lower levels. Granting authority to subordinates to make 
decisions within prescribed boundaries serves at least three functions: (1) it 
prevents overloading supervisors (Katz & Kahn, 1978); (2) it improves timeli- 
ness of responses; and (3) it places decisions at levels at which there may be 
greater expertise about particular issues than supervisors have. Control of 
subordinates may be exercised through professionalization, the inculcation of 
norms and values, or the specification of goals (Galbraith, 1973). Therefore, 
we expect that: 

Hypothesis 4: The greater the degree of task difficulty, the 
greater the degree of autonomy of unit members. 

Boundary-spanning activities imply both joint decision making with 
members of other units and needs to seek out and consolidate extraunit 
information in performing daily tasks. Thus levels of boundary-spanning 
activity are likely to be associated with the extent of subordinates’ knowl- 
edge regarding other units. Where extraunit knowledge is dispersed, rather 
than concentrated among supervisors, there are potential subordinate roles 
in making decisions having extraunit considerations in at least two situations. 
First, supervisors would be partly dependent on subordinates for informa- 
tion and advice when making decisions with extraunit considerations. 
Although they differ in the form of boundary-spanning activity invesiigated, 
several studies (Aiken & Hage, 1968; Hrebiniak, 1974; Leifer & Huber, 1977; 
Mohr, 1971; Tushman, 1979a) have supported its linkage to participation. 
Second, such joint decision making and subordinate expertise in extraunit 
matters would be consistent with limited supervision in day-to-day decisions 
with extraunit considerations. The work of Aiken and Hage (1968), Bacharach 
and Aiken (1977), and Leifer and Huber (1977) supported this linkage. There- 
fore, we expect that: 

Hypothesis 5: The greater the degree of boundary-spanning 
activity, the greater the level of unit members’ participa- 
tion in decision making. 

Hypothesis 6: The greater the degree of boundary-spanning 
activity, the greater the degree of unit members’ autonomy. 
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METHODS 
Research Setting and Sample 


The senior author carried out this study in the field and head office units 
of the Department of Employment Security of a western state as an unpaid 
researcher. Van de Ven and Delbecq (1974) described this agency's functions 
and structure. Following Van de Ven and colleagues (1976) and Tushman 
(1977, 1978), we used work groups as the units of analysis. First-line super- 
visors with at least one year as unit heads who had full-time subordinates 
with sufficient experience to be familiar with their positions answered ques- 
tions on the subordinates' jobs. The study included all units meeting the 
criterion for supervisory experience and containing at least two experienced 
subordinates. Each potential respondent received an individually addressed 
questionnaire through the internal mail system. Respondents mailed com- 
pleted questionnaires directly to the senior author. The data analysis includes 
only units in which responses came from both the supervisor and from at least 
two subordinates representing at least one-third of those qualified to respond. 
Questionnaires went out to 612 potential respondents in 131 work units; 381 
questionnaires (6296) were returned. Seventy-five units met the criteria for 
data analysis; these units included 360 potential respondents of whom 283 
responded for a 79 percent response rate. 


Variables 


Brief descriptions of each of the variables follow.? All items were mea- 
sured on 7-point Likert scales. Factor analysis with varimax rotation was 
used to group items into variables. The items selected loaded at > .40 on one 
factor. Table 2 gives means, standard deviations, and reliabilities. 

The four task difficulty items followed Perrow's (1967) concept of ana- 
lyzability (e.g., the extent to which individuals use standard methods). The 
two interunit interdependence items concern change in policies and pro- 
cedures induced by other subunits, a concept that applies to both B— A and 
A—> B work flow sequences. An item measuring this concept's complement, 
change induced in other subunits, confounded induced change and inter- 
action; hence the analysis omits it. However, that item's correlation with in- 
terdependence was .58 as compared with an interitem correlation for inter- 
dependence of .64. Thus, we considered the measure to indicate general 
levels of policy and procedural interdependence. The four boundary-span- 
ning activity items measured the frequency and time spent in interactions 
with members of other units. 

Frequency of participation was measured for each of four decisions on a 
scale ranging from almost always to almost never. Two of the decisions were 
task-related and two concerned personnel. A fifth item concerned influence 
in changes in methods initiated by subordinates. Autonomy was a composite 
of two items reflecting independence of action generally and two items refer- 


? Full texts of items and data for all analyses are available from the senior author. 
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ring to independence in handling nonroutine decisions. Two of these specific- 
ally referred to independence from “supervisors or specialists” in making 
decisions. The measure confounded two aspects of autonomy: supervisory 
constraints and constraints placed upon members of focal units by other 
units. However, as Table 2 shows, interunit interdependence was not associ- 
ated with autonomy (r = -.08), implying a bias toward supervisory constraints. 


Pooling 


` To compute scores for work groups, supervisors’ scores were added to the 
means of subordinates’ scores, and these sums were divided by two (Van de 
Ven et al., 1976). Three tests supported this procedure. One-way analysis of 
variance (Dixon & Massey, 1969) supported pooling subordinate responses 
into work groups for all of the variables (p < .005). No variable violated the 
homogeneity of variance assumption.? We dropped four units whose vari- 
ances exceeded the pooled variance (p < .05) for two or more variables from 
the study. 

Because there were scores for both supervisors and subordinates on each 
variable, a multitrait multimethod (MTMM) analysis (Campbell & Fiske, 
1959) was conducted; it supported pooling supervisor and subordinate scores. 
First, the significance of the supervisor-subordinate correlations for each 
variable met the convergent validity criterion (Table 1). Second, if supervi- 
sor and subordinate scores were to be pooled, the pattern of relationships 
amongst the variables had to be similar in both heterotrait, monomethod 
blocks—the supervisor block and the subordinate block. Following Brief and 
Aldag (1978), we used Spearman rank order correlation to test the similarity 
of the two patterns. This yielded a correlation of .72 (p < .05) that supported 
pooling. 


Methods Variance 


Two tests assessed divergent validity. First, all 40 comparisons met the 
criterion that the convergent validity correlations (rij) be greater than ry 
and r;; where i and j denote supervisors’ scores on variables i and j; similarly, 
i’ and j’ denote subordinates’ scores on variables i and j. The second 
criterion, that r; be greater than r;; and ry», was met in 31 of the 40 com- 
parisons. Table 1 shows the nine exceptions. Interdependence and boundary- 
spanning activity show differing patterns with other variables (Table 2), thus 
mitigating concern about common methods variance. However, conclusions 
regarding autonomy bear a caveat on potential methods variance. 


RESULTS AND DISCUSSION 


. Hypotheses 1 and 2 stated that interunit interdependence and task diffi- 
culty would be positively related to. boundary-spanning activity. Results 
supported these hypotheses with p < .001 and p < .01, respectively (Table 2). 


3 Lowest p = .30 with the Bartlett - Box test. 


Ito and Peterson 145 


1986 


100" > d,,, 




















LO" > dys 
o > d, 
692 = N ‘Dg 
I4 = Ne 
LAO Lë0 69°F AuIOUO NY "6 
x8U 640 SOT lee Sup[eur uorsroep ur uopedpned v 
x xO£' xx a BE' 08'0 Sen 6€'€ Ayapoe Surmueds-Arepunog € 
80'— 60° ann LÉI 89'0 SLL £8£ eouepuedopiejut j3runiejug 'z 
mid Mr PH E ZZO 86'0 Pore AYMOYTIP se] "1 
D L4 £ z KS DNKMIKC) Suorjetaa aSUBIW PUN saqe 38A 
paepueis 
vS9[Qe Te A 
Suoure suorje[aX10r) Jop.fO-0J97Z 
S9[q?11e A I[V Soe suonen 13p10-0197 pue sonsnejs aAndrso(q 
c TISV.L 

aver m PR waver = E 

yabe = ER: 
aver = "äi wend = ID +487" ` Aurouomy 'g 
DL . Supeur uoisrep ur uonedtonzeg "v 
Aaf = Ca Aaf = VEI xx «08 Ayano Surüueds-Ampunog ‘g 
vest = (era anata = PCy xxO eouapuedepzeju; jrumzejug 'Z 
PLE AYMOYZIP ai, 'T 
(4 dns Fy (fis Hy CHa) suoneyaaz0p saqe Rea 

SUOHP[23107) ayeurprogns Suorje[ol10;) Xosraadng ajeuirp1oqng-10srA10dng 
WOLD Aprea Waard 0) suondooxq ANPHeA 1uo219Au07) 





eSasA[Teuy Apea juaS1oArq pue juag19AUOr jo sj [nsa 
t WISV.L 


146 Academy of Management Journal March 


These findings from an administrative setting complement Tushman's (1977, 
1978) findings in a high technology setting. However, although Tushman 
found boundary-spanning activity to be concentrated in a few roles, the data 
in the present study did not permit an analysis of its distribution. 

Hypothesis 3, that task difficulty would be positively related to partici- 

pation, received only weak support {r = .21, p < .05), which is consistent with 
Mohr's (1971) and Hrebiniak's (1974) results. When task difficulty was corre- 
‘lated with each of the decisions, only two correlations were significant at 
p < .05; these were number of people and influence in subordinate-initiated 
changes in methods. Task difficulty was not related to two task decisions, 
introduction of new tasks and supervisor-initiated changes in methods. Other 
considerations, such as a need for acceptance of decisions (Vroom & Yetton, 
1973), may overshadow task difficulty in predicting participation (Mohr, 
1971). However, Duncan (1973) showed that linkage between uncertainty 
and participation was contingent upon the nonroutineness of decisions 
themselves, a characteristic this index did not measure. Therefore, this find- 
ing may reflect a flaw in our methods. 

Hypothesis 4, linking task difficulty with autonomy, was supported at 
p < .001, consistent with studies cited earlier in this paper (e.g., Schoonhoven, 
1981). In this study, autonomy was the dependent variable; however, Glisson 
(1978) argued for a linkage from autonomy to task difficulty, suggesting that 
workers given discretion may perceive variations in clients' needs and under- 
take nonroutine searches to solve problems. Nevertheless, in contrast to find- 
ings for participation, these results support Mohr's (1971) position that auton- 
omy and participation be considered as separate variables, rather than be 
subsumed under a rubric such as "organicness" (Leifer & Huber, 1977:237). 
In this light, these variables make different contributions to information- 
processing capacity; therefore they may enjoy different relationships with 
variables that affect information-processing requirements. 

Hypotheses 5 and 6, stating that boundary-spanning activity would be 
positively linked with participation and autonomy, received support at 
p < .001 and p < .01, respectively; these results were consistent with Leifer and 
Huber (1977). One contribution of this paper was isolating interunit 
boundary-spanning activity as affecting two dimensions of subunit structure, 
both of which are aspects of the influence of subordinates in decision making. 
Although our rationale included participation and autonomy as dependent 
variables, the relationships may be reversed or there may be mutual causation. 
Participation in decisions with extraunit implications may require boundary- 
spanning activity during implementation; where autonomy and interunit 
interdependence exist, unit members may be encouraged to seek out extraunit 
information through personal efforts. Longitudinal research to assess causal 
directions would facilitate understanding how changing designs to match 
requirements and capacity for information processing may stimulate shifts, 
or pressures for shifts, in the degree of influence subordinates exercise in de- 
cision making. Thus, such research would have implications for supervisory 
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style. It would also help to reveal how shifts in subordinate influence may 
cause design changes to emerge. 

This study has a number of limitations. First, as the Methods section 
indicated, there was potential common methods variance for autonomy. 
Second, only one organization, a people-processing state agency, was studied, 
and the units of analysis were work groups whose members had no supervi- 
sory responsibilities. Finally, this study used no measures of effectiveness; 
consequently we can draw no conclusions about whether the relationships 
found characterized effective or ineffective units. 


CONCLUSIONS 


Results support all six hypotheses and also support and extend research 
in information processing as an approach to understanding organizational 
design and structure. Two particular themes may contribute to further research 
in this area. The first is the need to specify design elements’ particular contri- 
butions to information-processing capacity and relate these to specific 
demands for capacity that such factors as task difficulty induce. The second 
is the relationship between coordination mechanisms and elements of 
intraunit structure. Research in this area may assist practitioners by helping 
to predict changes in relationships between supervisors and subordinates 
that design changes may induce—or vice versa. 
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SOLUTION DIVERSITY AND THE QUALITY 
OF GROUP DECISIONS 
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Research investigating the effects of diversity on group decision making 
has been extensive. For example, Shaw's (1981) review discussed the effects 
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of diverse needs, personality factors, abilities, genders, races, and other dif- 
ferences among individuals within groups. The presumed benefit of a diverse 
group is that its members’ different points of view lead to a diverse set of 
recommended solutions. Researchers have suggested that a need to reconcile 
diverse solutions stimulates effective group discussion that leads to a high 
quality decision. Studies that measure solution diversity directly and relate 
it to the quality of group decisions are rarer than studies measuring diversity 
in group composition. Yet, investigators have considered solution diversity 
to be important. Examples are Maier’s work on creative group decisions 
(Maier, 1963, 1967, 1970; Hoffman, Harburg, & Maier, 1962} and Janis’s 
(1972, 1982) accounts of groupthink. Janis recommends a number of ways to 
prevent or overcome effects of groupthink; many of these involve increasing 
the diversity of solutions to problems that groups consider. 

The present study continues to examine the link between solution diver- 
sity among a group’s members and the quality of a group’s final decisions. 
There are at least three reasons to continue research in this area. First, despite 
the popularity of both Maier’s and Janis’s advice that group leaders encour- 
age solution diversity, relatively little supporting empirical research has 
emerged. Thus, research is needed that substantiates a direct link between 
solution diversity and the quality of group decision. If such a link can be 
established, systematic research can continue to investigate which interven- 
tions by group leaders (Janis, 1982; Maier & Hoffman, 1960, 1965) best stimu- 
late solution diversity. 

A second reason for conducting the present study was that much of the 
previous research lacked good criterion measures of the quality of group 
decisions. Janis’s work is mostly narrative. Maier’s measure of decision qual- 
ity was an experimenter judgment, usually dichotomous, of group decisions.? 
Such a criterion is flawed to the extent that experimenters know the hypothe- 
sis being tested; moreover, researchers typically consider categorical mea- 
sures less desirable than continuous measures. Furthermore, in Maier’s 
research group, members acted out scripted roles, rather than simply being 
themselves; this, too, could contaminate the criterion. 

A third reason to continue such research is that solution diversity is a 
form of group member diversity that can be more easily manipulated than 
other types. For example, it is generally not practical, and frequently 
impossible, for managers to sort organization members into small groups in 
terms of demographic, need, personality, or ability factors for particular 
problems, and then re-sort them for new problems. Thus, previous research 
on diversity in group composition? is of limited usefulness to most managers. 

However, solution diversity is not a pure individual difference variable, 
as diversity among group members is. Rather, it is the result of interaction 
between people and situations. The amount of solution diversity within a 
given group of people varies with problems, but group composition factors 


1 See Maier (1970) for an entire research program. 
2 See Shaw (1981) for a review. 
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stay constant across varied problem situations. Because solution diversity is 
not a fixed characteristic of groups or their members, it can vary naturally 
across problems. Group leaders can also consciously manipulate solution 
diversity along the lines Maier (1970) and Janis (1982) suggested; these 
include having a devil’s advocate, inviting outsiders, or holding second 
chance meetings. 


METHODS 
Research Setting and Participants 


Data were gathered from 98 ad hoc groups of students (N = 594) who were 
enrolled in an introductory course on management and organizational 
behavior. As part of this course students did several experiential exercises to 
illustrate behavioral science concepts. One of these exercises concerned dif- 
ferences between decisions made by individuals working alone and those 
made by groups. Data gathered in this classroom context do not neatly fall 
into either a laboratory or a field category, so it is important to specify how 
the present study compares to stereotypes of field and experimental lab 
studies. 

The physical setting, a college classroom, was a more natural setting 
than the typical behavioral science lab in which to study group dynamics; 
however, it was a more temporary setting than is typical in work organizations. 
The motivation of the participants in the present study was to learn some- 
thing about group dynamics and themselves. They did not know these data 
would be collected after the exercise and had no reason to see the exercise as 
a situation in which they were subjects for study. This probably reduced the 
artificiality of the experience and also reduced the likelihood of experimen- 
tal demand characteristics being factors here. Gathering data from a learning 
exercise in a classroom also meant the behavior and the setting were consis- 
tent with each other. 

Instructors asked their students to form small groups for the decision 
making exercise.? Students formed natural groups, typically joining those 
sitting in adjacent seats. Thus, groups were not random, as in a typical lab 
study. The process of group formation resembled that seen in friendship 
groups, or in informal groups in organizations like cliques and social groups. 

As a result of the characteristics of the group formation process, groups 
varied in size from three to nine members; two-thirds of the groups had four, 
five, or six members. We removed the effects of group size and of odd or even 
numbers of members by hierarchical regression. Details appear in the Data 
Analysis section. 


3 Differences among the ten instructors could not affect these data because their role during 
the exercise was simply to read the instructions and hand out materials. However, after the 
groups had generated data, the instructors' role became more active. They read the expert 
opinions, and after groups calculated scores they conducted class discussions of the results. 
Therefore, instructor differences only appear after all the data have been generated. 
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Although this research setting was in some ways more natural than that 
of a typical experimental lab study, it was not equivalent to a field setting. 
First, the problem or task to be solved did not arise as part of a job, although 
it had some elements of a real world decision, as the Discussion section 
explains. Second, the groups were ad hoc rather than somewhat permanent 
work groups; they resembled temporary committees. Third, the consequences 
of individuals’ behaviors were much less significant than would be the case 
in true organizational field settings. 


Task 


Two very similar group consensus ranking tasks were used: Subarctic 
Survival (Eady & Lafferty, 1974) and Mountain Survival (Kast & Rosenzweig, 
1976). Each task has individuals assume they are stranded, either in the 
subarctic or on a mountain, and then has them rank order 15 items according 
to their importance for survival. First students ranked the items alone; then 
the groups were formed and instructed to reach group consensus on the 
ranking of the 15 survival items. The task instructions, which were read to 
the students, defined consensus as discussing a problem among themselves 
and trying to reach a group decision about the rank ordering of the 15 items. 
They were instructed not to vote to resolve conflict, nor to average their 
individual solutions to reach a group solution, because voting and averaging 
would defeat the purpose of the exercise. Finally, class instructors announced 
the correct rank orders, based on expert opinions provided by the developers 
of these tasks.* The scoring of individual and group solutions was then 
completed, as described in the next subsection. 


Measures 


The quality of group decisions was determined by comparing groups’ 
rankings of the items to the expert ranking. Specifically, we determined the 
absolute value of the differences between a group’s ranking of the 15 items 
and the expert ranking. The sum of these 15 absolute differences was the 
measure of decision quality. The lower this number, the higher the decision 
quality, because low numbers meant that a group’s rank order closely matched 
that of the experts. 

Five control variables were used to predict the quality of group deci- 
sions before we added the research variable of interest, solution diversity 
among group members. The five were: (1) group size; (2) odd or even number 
of group members; (3) type of group decision problem—subarctic or mountain 
survival; (4) best individual score; and (5) average of individual scores. We 
referred to these five variables as control variables in the hierarchical regres- 


4 The experts for the Subarctic Survival Problem were the Para-rescue Specialists for the 
413th Transport and Reserve Squadron, the Canadian Forces, stationed at Summerside, Prince 
Edward Island. The squadron is responsible for air and sea rescue operations in Quebec, New- 
foundland, Labrador, and the Arctic Regions. The expert opinion for the Mountain Survival 
Problem was provided by David Kneeland, Director of the Institute for Survival Education in 
Seattle, Washington. 
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sion equation, because no experimental design controlled their effects. 
However, their entry into the regression equation prior to solution diversity 
statistically controlled these variables. We discuss our reasons for choosing 
these control variables and for their sequence in the regression in the Data 
Analysis section. 

The variable of major concern, solution diversity among group members, 
was measured with Kendall’s coefficient of concordance, W. This statistic 
assesses the overall degree of agreement in a set of rankings given by several 
individuals (Siegel, 1956: 229—239). The coefficient of concordance is the 
n-case extension of a Spearman rank order correlation coefficient (r,), or 
Kendall’s rank order coefficient (tau). Values of W range from 0 to 1, not —1 
to +1, as in a Pearson correlation, with low values indicating low agreement. 


Data Analysis 


The data were analyzed by hierarchical multiple regression in which the 
control variables were entered first into the regression equation in a specific 
temporal order. The measure of solution diversity among group members, 
Kendall’s W, was entered last to determine how much of the remaining 
predictable variance in the quality of group decisions it accounted for. We 
used an F-test to calculate the statistical significance of changes in R? values 
at each step of the hierarchical ordering of variables. 

The logic for ordering these variables in the regression equation was as 
follows: We entered total group size first because size could affect the level 
of solution diversity; however, the reverse is illogical. Second, whether the 
group had an odd or even number of members was entered before solution 
diversity because it too could affect levels of diversity. We considered this a 
possibility because voting to reduce conflict is more likely with odd num- 
bers of group members. Third, we entered the type of ranking task before 
solution diversity, using a dummy code, because there could be differences 
between the two tasks used here. 

After statistically controlling for group size, odd-even composition, and 
problem type, we entered the two measures of individual solution quality— 
best individual score and the average of individual scores—as the fourth and 
fifth control variables. These were entered before solution diversity because 
they represented the temporal starting points of group discussions. Further- 
more, both measures of individual solution quality also acted as controls for 
two critical individual differences, general IQ and specific expertise, that 
normally would be controlled for by randomization of subjects to groups. 


RESULTS 


Table 1 shows the means, standard deviations, and intercorrelations 
among the variables. The pattern of intercorrelations confirms the necessity 
of entering the five control variables into the regression equation prior to 
entering solution diversity. First, as group size increases, the level of diver- 
sity (W) also increases (r = —.36, p <.01), as had been anticipated. Second, 
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TABLE 1 
Means, Standard Deviations, and Intercorrelations of Variables 

Variables Means s.d. 1 2 3 4 5 8 
1. Group size 6.01 1.51 
2. Odd or even number of 

group members? 0.50 0.50 .06 
3. Problem type” 0.55 0.50 —.58** —.10 
4. Best individual score? — 41.77 9.14 -—.08 .02 —.08 
5. Average of individual 

scores? 57.14 6.39 .22* 41 -—.42** .63** 
6. Solution diversityd 0.56 0.10 —.36* —.11  .30** .08 —.27 
7. Decision quality* 43.56 11.43 .12 10 —.27** .52** .66**  .03 


*Coded 0 = odd number of group members, 1 = even number of group members. 
bCoded 0 = Mountain Survival Task, 1 = Desert Survival Task. 
*Low values mean high scores or high decision quality. 
dLow values mean high diversity. 
*p < .05 
**p « .01 


there was a slight tendency (r — —.11, n. s.) for increasing diversity to be 
found in groups with even numbers of members, as expected. Third, we 
found removal of the variance owing to the type of task to be desirable, since 
the type of task was correlated with both solution diversity (r = .30, p <.01) 
and quality of group decision (r = —.27, p « .01). The Subarctic Survival task 
produced less diversity and was easier than the Mountain Survival task. 
Finally, removing the effects of the best individual score and the average of 
the individual scores was desirable, because both were strongly correlated 
with group decision quality (r = .52, p < .01 for best score, andr = .66, p < .01 
for average score), although neither correlated significantly with solution 
diversity. 

Table 2 shows the results of a hierarchical regression used to determine 
the incremental variance in the quality of group decisions accounted for by 
(1) the control variables and (2) solution diversity among group members. 
Examination of Table 2 indicates that the first variable, group size, did not 
account for a significant change in R?, nor did the odd-even composition of a 
group. However, the type of problem did account for a significant amount of 
variance (696, p <.01) in groups’ decision quality scores. The best individual 
score accounted for 25 percent (p « .01) of the variance in group decision 
quality, and the average individual score accounted for an additional 13 per- 
cent (p « .01). 

The degree of solution diversity among group members also accounted 
for a significant amount of variance (396, p « .05) in group decision quality. 
Besides accounting for a significant portion of variance in the quality of 
group decisions, the effects of solution diversity are also in the direction 
expected. That is, the positive correlation in Table 1 and the positive beta 
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TABLE 2 
Summary of Results of Hierarchical Regression Predicting Decision Quality 





Variables Cumulative H?s R? Changes Betas 
Group size 01 01 046 
Odd or even number of 

group members 02 .01 .047 
Problem type .08 .06** —.033 
Best individual score 33 .25** .116 
Average of individual 

Scores .46 A3** .618** 
Solution diversity A0 .03* .215* 

*p < DR 
**p < DI 


weight (p « .05) in Table 2 show that increasing diversity is associated with 
increasing quality. 


DISCUSSION 


The results of this study provide support for the general advice that 
diversity in groups enhances decision quality. Among those studies that 
have specifically examined solution diversity, the present research demon- 
strates the effects of such diversity more precisely than earlier efforts. Given 
the procedures for data gathering, measurement of key variables, and conser- 
vative analysis method, these data show solution diversity to have a specific 
and identifiable positive effect on decision quality, even after the influence 
of several other important factors has been removed. 

The fact that measures of individual solution quality were much stronger 
predictors of the quality of group decisions than solution diversity does not 
really weaken the conclusion that solution diversity enhances decision 
quality. The relationship between individual and group decision quality 
should be quite high for two obvious reasons. First, they are measures of the 
same type—quality—-and thus share much common method variance. Second, 
doing the same task twice within about one hour's time? almost amounts to 
animmediate test-retest measure, which certainly would inflate a relationship. 

Besides the two methodological reasons discussed above, there are other 
grounds for discounting the practical significance of the two individual solu- 
tion quality scores, even though they account for most of the variance and 
are highly significant. Specifically, what can be said to group leaders other 
than "give the problem to your best subordinate, or if you don't know who 
that would be, make sure your group is of high average competence"? The 
first part of the advice assumes that leaders can identify such individuals; 
but frequently leaders cannot know a priori who will be the most competent. 
In fact, one reason why leaders use groups is because they cannot specify 


5 Groups’ consensus solutions followed individuals’ rank order solutions. 
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such individuals; this is particularly true when problems are unstructured 
(Vroom & Yetton, 1973). The second part of the advice is also a bit glib, since 
it presumes that leaders have enough control over the composition of work 
groups to ensure a high average level of competence. In contrast, managing 
solution diversity among group members appears to be something most man- 
agers can control. 

Other forms of diversity that have been studied, such as needs, per- 
sonalities, abilities, or demographic factors, would be much more difficult to 
affect within a group in any reasonable period of time. The major implica- 
tion for action from studies of diversity among group members has been to 
recommend appropriate composition.and recomposition of groups. Practi- 
cally speaking, this is quite difficult even in the most fluid organizations. 
Hence, Maier’s (1963) recommendation, that leaders encourage groups to 
develop second solutions, or Janis’s (1982) advice, that leaders appoint devil’s 
advocates, invite outsiders to meetings, hold second chance meetings, and 
so forth, have greater relevance for actual managerial practice than recom- 
mendations of group recomposition. 

With all the emphasis in the preceding section on the management of 
solution diversity, it must be remembered that the present research did not 
involve manipulations or interventions. Rather, the degree of naturally occur- 
ring solution diversity that members brought to the group discussion was 
related to decision quality. Specifying that diversity enhances decision qual- 
ity is a necessary first step in research of this type. A subsequent study could 
investigate links between different types of interventions by group leaders, 
such as those mentioned in the preceding paragraph, and degrees of solution 
diversity generated as a result. Whether there might be a difference between 
naturally occurring solution diversity brought to meetings and that resulting 
from a leader's actions during meetings is also a potentially interesting 
question. 

The present study has several limitations. First, the participants in this 
research were not well acquainted with one another, and they formed tempo- 
rary groups solely for the purpose of this study. However, previous research 
found that the beneficial effects of solution diversity are strongest in estab- 
lished groups (Hall & Williams, 1966), because controversy is usually less 
threatening than it is in new groups. Because solution diversity did enhance 
the quality of group decisions in the present study using ad hoc groups, the 
effect of solution diversity found here could be a lower bound. Future research 
might profitably examine the effects of diversity on established versus ad 
hoc groups in order to estimate the upper bound of the effect as well. 

A second limitation concerns the range of solution diversity, because 
extreme values of Kendall's coefficient of concordance did not occur here 
(mean W =.56, s.d. — .10). Whether extremely high or low solution diversity 
would continue to enhance decision quality is also a question for further 
research. 

A third limitation is that solution diversity seems to enhance decision 
quality, but may lower decision acceptance and group member satisfaction 
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(Castore, 1973; Castore & Murnighan, 1978; Shaw, 1981). Since both quality 
and acceptance are components of overall decision-making effectiveness 
(Maier, 1963; Vroom & Yetton, 1973), trade-offs may need to be considered 
between them, particularly when a problem or decision has both high qual- 
ity and high acceptance requirements. 

The final limitation is that we were not able to obtain data about the 
process of group decisions in this study. Thus, there are no direct data, such 
as observations or self reports, showing precisely how solution diversity 
enhanced decision quality. Unsystematic personal observations of group 
process by the researchers and the instructors who helped gather these data 
suggest that diversity may increase decision quality by increasing the amount 
of attention and discussion paid to each group member's individual solution. 
The resulting scrutiny could easily lead to the surfacing of unrecognized and 
incorrect assumptions among group members. For example, consider this 
situation of low diversity: one group member ranks an item in the survival 
task eighth and a second member ranks it tenth. In such a case it will be 
tempting for the members to settle their minor conflict easily. In contrast, 
when one member ranks a particular item first that another ranks as fifteenth, 
there is an extremely high diversity. In this case, the members are much 
more likely to discuss the reasons for their rankings than in the case of low 
diversity. As a result, the assumptions each has made are more likely to 
surface, deepening their understanding of how to cope with the task. 

The final point for discussion concerns external validity. Maier (1963) 
distinguished between solution generation and solution evaluation. It is clear 
that this study concerns the second of these two phases of decision making. 
In this sense, it can complement research on the process of solution 
generation, like brainstorming research. The fact that the present research 
addressed the solution evaluation stage exclusively might be assumed to bea 
major limitation to its external validity. However, this assumption is false. 
Highly structured problems with clear alternatives often arise, and the only 
problem is to evaluate them. 

In general, any situation in which decision alternatives come to a group, 
rather than being generated by a group, resembles the task used in the pres- 
ent research. In such situations solution evaluation is the critical process, 
since it is under group control. An extremely common, analogous situation 
occurs when groups must evaluate job applicants and then rank order them 
so job offers can be extended. Similar examples include groups’ deciding on 
the rank order of employees for promotions as positions open up, or ranking 
who should be sent for advanced training. In all cases the groups making the 
decisions have clearly defined alternatives and must decide on rank orders. 
Human judgment assesses the quality of a group's decision, as in the pres- 
ent case. Since these examples of personnel decisions are common to most 
organizations, our results should have fairly wide applicability. 

These results may have even wider applicability if solution diversity 
enhances decision quality regardless of the particular type of problem. We 
gave a conservative view of external validity earlier in this section, stating 
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that the results generalize only to structured problems concerned mainly 
with solution evaluation, because that situation mirrors the task used to 
generate the data. However, the works of both Maier (1970) and Janis (1982) 
concern tasks and problems with unstructured characteristics; their results 
and recommendations also support the beneficial effects of solution diver- 
sity on the quality of group decisions. 

As a final note, there are four circumstances in which solution diversity 
may be counterproductive. Although none of these characterized the present 
study, they may be boundaries to the beneficial effects of solution diversity. 
First, if diversity leads to interpersonal conflict, overall group effectiveness 
may decline (Evan, 1965). Second, if diversity prevents agreement on basic 
goals for a group, paralysis may result. Third, in crises with severe time 
pressure, diversity may slow a group too much. Fourth, if a decision to be 
made has minimal quality requirements, but high acceptance requirements 
(Maier, 1963), solution diversity may prevent consensus. 
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Data for organizational research are often collected directly from organi- 
zational members, either through paper-and-pencil questionnaires or oral 
interviews. For example, of the 60 articles published in the Academy of 
Management Journal in 1983, 34 were based on data collected by question- 
naire or interview. (See the Appendix.) Although questionnaires and inter- 
views differ in many subtle and important ways, two of the most powerful 
differences concern costs and response rates. Interviews are generally more 
costly in researcher and respondent time than are questionnaires, but they 
yield a higher response rate on a comparatively smaller sample size (Kidder, 
1981: 148— 155). Every organizational researcher faces a cost/response trade- 
off when choosing data collection methods. This paper describes a new tool 
for data collection—electronic mail (EMS)—and suggests that under certain 
conditions it may produce higher response rates at a lower cost than either 
questionnaires or interviews. 


The data reported in this paper were collected while the author was an NIMH Research 
Fellow at Stanford University. A grant from the System Development Foundation supported 
preparation of the manuscript. Jonathan Cender, Martha Feldman, Sara Kiesler, James March, 
and Bob Sproull contributed substantially to the ideas and analysis presented here—the inter- 
esting parts anyway. 
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Electronic mail uses computer text-editing and communications tools to 
provide a high speed message service. Anyone with a computer account can 
use a terminal to compose a message and send it to anyone who has a 
mailbox on that computer or on any other computer that communicates with 
the sending computer. Four important characteristics of EMS are its speed, 
asynchrony, lack of intermediaries, and ephemerality of messages. Messages 
can be transmitted in seconds. They are sent at the convenience of senders 
and read at the convenience of recipients. The scheduling constraints of syn- 
chronous communication vanish. Senders and receivers usually process their 
own mail; they do not submit messages to or receive messages from secretaries 
or operators who process messages for many people. Messages appear on and 
disappear from video screens with no hard copies left behind. Every system 
makes it possible to store messages on computer files and to create hard- 
copies of them, but most messages are never translated into hard-copy form. 
These four characteristics lead to a communication system that is relatively 
fast and convenient to use. Organizations currently using EMS include AT&T, 
Bank of America, Digital Equipment Corporation, Hewlett-Packard, IBM, 
Manufacturer’s-Hanover Trust, Westinghouse, and 3-M. 

As EMS becomes more prevalent in organizations, it may become an 

.appropriate medium for collecting data from organization members. For 
example, a researcher could easily send out questionnaires via electronic 
mail. Recipients would simply edit in their responses to questions and then 
return edited questionnaire messages to the original sender. It would seem 
that researchers could collect data rapidly and conveniently this way. Postal 
mail delays would not occur. Respondents could fill out questionnaires at 
their convenience. 

Three general issues should be considered in exploring the feasibility of 
using EMS as a research tool: (1) respondents' access to EMS; (2) their 

- willingness to respond to surveys via EMS; and (3) the comparability of data 
collected via EMS with those collected by more conventional means. Although 
use of electronic mail is increasing rapidly, it currently reaches and is used 
by only small numbers of organizational members. When EMS users are an 
appropriate research population for particular research projects, accessibil- 
ity is not a problem. As EMS use increases, the issue of access will decrease 
in significance. A second issue is willingness of potential respondents to 
participate in research conducted via EMS. If potential respondents see such 
use of EMS as illegitimate, aversive, or unrewarding, participation rates may 
be low in comparison with those for other media. Respondents may worry 
about possible violations of privacy or lack of anonymity. They may find the 
medium too impersonal for some kinds of questions. Evidence suggests that 
organizational members enjoy using EMS in their jobs and evaluate it quite 
positively (Crawford, 1982; Nyce & Groppa, 1983). This does not mean they 
will necessarily respond enthusiastically to EMS surveys, but it does suggest 
that they will not be discouraged by aversion to the medium itself. The third 
issue is substantive differences in answers that might be produced as a result 
of communicating via EMS versus some other medium. There is a tradition in 
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social science research of trying to understand how the means of collecting 
data affects that which is collected (e. g., Stewart & Cash, 1978; Sudman & 
Bradburn, 1982). If EMS for some reason produces data that differ from those 
generated by other media, its use could not be justified without comparative 
research on the nature of those differences. 

This paper investigates the second and third issues. It describes an organi- 
zational research project carried out entirely by electronic mail. It contrasts 
the procedures and results of this project with (1) those of a comparison 
project conducted in the same organization using conventional data collec- 
tion techniques, and (2) with response rates reported in the literature. 


RESEARCH SAMPLE AND PROCEDURE 


The R&D and product development divisions of a Fortune 500 office 
products and systems manufacturer were the setting for this research project. 
Within each division one unit was selected that represented a well-established 
electronic mail community. This organization has used EMS for 12 years; 
over 80 percent of all employees in the units selected have excellent access 
to EMS terminals. They use the system extensively; the mean message traffic 
per day per person. is 24 messages. All participants in this study were 
professionals. Ten men were selected at random from the professional ranks 
of the research organization; ten men and ten women were selected from the 
professional ranks of the product development organization. Thus, the size 
of the total target sample was 30 professionals from well-established elec- 
tronic mail communities. The comparison project had a comparable sample 
of 30 randomly selected professionals. Respondents in the two samples did 
not differ on age, education, job category, number of years in the organiza- 
tion, number of years using EMS, or access to EMS. Their mean age was 34; 
their mean number of years of schooling was 16; their mean number of years 
in the organization was 4; their mean number of years using EMS was 5;* and 
78 percent had EMS terminals in their offices. 

The general data collection protocol asked participants to save all of 
their electronic messages for three days and to then answer a series of ques- 
tions about some of them. Participants were asked 10 open-ended questions 
about the nature of each message, for example, "Why did you receive (or 
send) this message? What is it about?" They were asked 13 closed-form 
questions about each message partner, for example, "What is this person's 
hierarchical position with respect to you?" with 1 - subordinate, 2 = peer, 
3=superior, 4— not in chain of command. Each participant also filled out an 
80-90 item questionnaire about general EMS attitudes and behaviors. The 
questionnaire included items on personal history, EMS history, evaluation 
of current EMS programs, self-reports of EMS behavior, and questions about 
how respondents would choose to do a series of hypothetical tasks. 


. * Some respondents had used EMS in other organizations in which they had been employed. 
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In the EMS version of this research, all communication was via elec- 
tronic mail. The researcher, who had a mail account on one of the corpora- 
tion’s computers, sent an invitation to participate in the project and an 
endorsement from the division vice-president as an electronic message. The 
invitation message asked participants to send back messages indicating 
interest. When a participant responded, the researcher sent a message ex- 
plaining the data collection procedure. This message told participants to 
save all of their messages for three days in a computer file and then send 
the file to the researcher, who would send back a sample of messages with 
questions to be answered about each. After a participant returned three days’ 
worth of messages, the researchers sent back all messages from the third day, 
or a random sample of 15 if there were more than 15 messages on that day. 
Each went as a separate message from the researcher with this statement 
appended: 


Thank you for sending me your messages. I have chosen arandom 
sample of 15 of them [or, a sample of them] about which to ask 
you some questions. I am sending each of these messages back to 
you with questions appended on the end. Please use the "for. 
ward" command to copy this message into the edit window, then 
edit your answers in to each question, then ''deliver" the result 
back to me. I am also sending, as the 16th message, a brief ques- 
tionnaire; you should also forward it back to me with your an- 
swers edited in. 


Ten open-ended questions about the nature of the message and 13 closed- 
form questions about characteristics of message partners' followed each 
message. The final message sent a questionnaire about general EMS attitudes 
and behaviors that included nine items about the experience of providing 
data electronically. 

The conventional study was designed to produce a very high response 
rate. Its participants received a general letter explaining the nature of the 
research accompanied by an endorsement letter from the division vice- 
president. The interviewer then telephoned participants to answer any ques- 
tions, explain the research procedure, and schedule interviews. They were 
told to save all of their messages for a three-day period and to have hard 
copies available at the time of their interviews. In face-to-face interviews they 
(1) orally answered 10 open-ended questions about all messages from the 
previous work day, or a random-sample of 15 if there were more than 15 
messages on that day; (2) filled out a 13-item closed-form characteristics 
sheet for each message partner; and (3) filled out a written questionnaire. 

In summary, both groups were asked to save copies of their messages 
and make them available to the researcher either in an on-line file or a hard 
copy. Both groups answered the same open-ended questions about a sample 
of their messages, either electronically or orally. Both groups answered the 
same questions about their message partners, either electronically or in 
writing. Both groups filled out a general questionnaire either electronically 
or in writing. Participants in the electronic version also answered nine ques- 
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tions at the end of the questionnaire about the experience of providing data 
electronically. Participants in the conventional version were asked no com- 
parable questions. 


RESULTS 
Participation 


The participation rate in the electronic version of the study was 73 
percent. The participation rate was 87 percent in the conventional study 
(t=.86, p=n.s.). In a follow-up phone call, four out of six nonparticipants in 
the electronic version indicated they would have been willing to participate 

but they had forgotten to respond to the original message because they were 
too busy. Table 1 shows nonparticipation rates for the two studies. 

The 73 percent participation rate is high in comparison with typical 
response rates in hard-copy mail questionnaires, which typically range from 
20 to 50 percent (Heberlein & Baumgartner, 1978). Organizational researchers 
often do better than public opinion researchers, but response rates still are 
low enough that nonresponse bias can be a problem. For example, the mean 
response rate across the 14 mailed questionnaire studies published in the 
Academy of Management Journal in 1983 was 53 percent. (See the Appendix.) 

A mail study conducted at the same time as the EMS study reported in 
this paper offers a direct, although unplanned, comparison of EMS with 
hard-copy mailed questionnaires. Steinfield (1983) conducted a study of 400 
people who worked in the same divisions of the corporation as the partici- 
pants in the study reported here. His study had the same corporate sponsor- 


TABLE 1 
Nonparticipation Rate for Two Different Means 
of Data Collection in a Field Research Project 


Means of Data Collection 
EMS Face-to-Face 


Research Product Development Research Product Development 


Reason for 

Nonparticipation Men Men Women Men Men Women Totals 
Did not respond 2 2 1 0 0 0 5 
Refused 0 1 0 1 0 0 2 
No longer 

employed 1 0 0 0 2 1 4 
No EMS access 1 0 0 0 0 0 1 
Total 

nonparticipants 4 3 1 1 2 1 12 
Total 

participants 6 7 9 9 8 9 48 


N= 10 10 10 10 10 10 60 
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ship as this one and included respondents from the same professional levels. 
The topic of that study was also similar to the topic of this one. He reported a 
response rate of 55 percent using hard-copy mailed questionnaires. 


Duration of Data Collection 


The average elapsed time for data collection was half as long in the 
electronic version as it was in the conventional version. In the face-to-face 
research, it was necessary to wait four days after mailing invitation letters 
before telephoning to schedule interviews to make sure that respondents had 
received their initial letters in the mail. Only about 60 percent of the tele- 
phone calls actually reached the participants on the first day, requiring 
callbacks on subsequent days. Interviews were scheduled a minimum of four 
days after phone calls so that participants could save three days worth of 
messages. The average elapsed time between initial letter and completed 
protocol in the face-to-face research was 12 days; the average elapsed time in 
the electronic mail version was 5.6 days (t=9.3, p « .01). 

The time each respondent actually spent answering questions seemed to 
be about 15 minutes longer in the EMS version than in the face-to-face 
version. (EMS respondents received nine more questions than the face-to- 
face respondents.) The amount of time required for a face-to-face interview 
averaged just under one hour. The average time reported by participants in 
the EMS version for responding to all the messages was 74 minutes {t= 1.38, 
p^n.s.). 


Completeness of Data 


The questionnaire data were slightly more complete in the conventional 
study than in the EMS study, although in both cases they were well within 
acceptable bounds. In the face-to-face version, the interviewer could check 
the questionnaires on the spot to make sure they had been filled out properly 
and completely. This was impossible in the EMS version, hence the missing 
data rate was higher. In the face-to-face version the overall mean missing 
data respondent was .2 percent; in the EMS version it was 1.4 percent (t = 1.16, 
p=n.s.). 


Substantive Comparisons 


There were no differences in the nature of answers provided by EMS 
participants compared to those provided in the conventional study, as 
assessed by t-statistics. Both sets of participants reported the same amounts 
of message traffic. They reported the same EMS writing styles in terms of 
how often they used salutations and closings and fixed typos. They also 
reported the same EMS histories, the same frequencies of work and nonwork 
messages, the same incidence of inappropriate messages, and the same choices 
of medium for a series of hypothetical tasks. 

Most of the questions in this study were factual or objective. We would 
not expect the medium through which they were asked and answered to 
affect the substance of people’s responses. When questions tap subjective 
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states such as emotions or definition of self, however, the medium used may 
be more likely to affect the nature of the answers. The stronger the social 
presence of the researcher, the more likely people are to report positive or 
socially desirable responses (Locander, Sudman, & Bradburn, 1976; Woods 
& McNamara, 1980). Other research has demonstrated that electronic re- 
sponses, in comparison with oral or hard-copy responses, tend to be more 
extreme, less socially desirable, and characterized by more self-disclosure 
(Kiesler, Siegel, & McGuire, 1984; Kiesler & Sproull, 1986; Kiesler, Zubrow, 
Moses, & Geller, 1984). This is because electronic mail provides almost no 
information about the social presence of message partners or the social con- 
text of communications. Without the regulating effects of such social infor- 
mation, responses tend to be more extreme. The most subjective questions in 
this study were those about how people would choose to do 12 hypothetical 
tasks. As reported earlier in this section, there were no differences in mean 
responses to these questions across the two studies. There was, however, a 
difference in extremity of response. Each hypothetical question could be an- 
swered on a scale from 0 to 10; every 0 or 10 response was coded as an ex- 
treme response. The EMS study elicited more extreme responses than did the 
conventional study. Out of 48 hypothetical task questions, the EMS study had 
a larger number of extreme responses in 35 of them (t= 3.24, p < .01). This was 
the only way in which the two studies differed substantively. Extreme re- 
sponses are important substantively, because they can indicate very strong 
opinions. The differences in extremity of response are noteworthy in this 
study because they suggest that there may be an interaction between method 
of data collection and question type for some questions but not others: method 
may affect subjective indicators but not objective or factual ones. 


Attitudes toward Data Collection 


Participants in the electronic mail version varied in their attitudes toward 
providing data via electronic mail, although mean attitudes were fairly neutral. 
Participants in the face-to-face version were not asked a comparable set of 
questions. Table 2 summarizes responses to a series of questions about the 
process of providing data via electronic mail. 

It is possible to compare attitudes toward participating in further research 
using the same media as these studies—either electronic mail or face-to-face 
interviews. Participants in the face-to-face version were asked: “With your 
permission may I contact you again to talk more about electronic mail?" All 
of the face-to-face participants said yes. Participants in the EMS version 
were asked: “Would you be willing to be contacted again via electronic mail 
at some later date to answer more questions about your use of electronic 
mail?" Ninety percent of the EMS participants said yes. The attitudinal data, 
although certainly not conclusive, suggest no serious disadvantage for elec- 
tronic mail in comparison with face-to-face data collection. Table 2 summa- 
rizes the attitudinal data. 
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TABLE 2 
Distribution of Attitudes toward Providing Data via EMS 


Frequencies of Responses 


Attitudes? 1 2 3 4 5 Means 
Convenience 4 9 4 5 0 2.4 
Ease of providing data 3 5 8 6 0 2.8 
How time-consuming? 0 4 8 4 6 3.5 
How interesting? 2 3 11 3 3 3.1 ^ 
Prefer EMS? 6 5 2 3 6 2.9 


SAI attitudes were measured on 5-point Likert scales. Questions were of the form “How con- 
venient did you find this data collection process?" 1- very convenient; 5= very inconvenient. 


DISCUSSION 


In this study, an EMS questionnaire produced (1) a response rate of 73 
percent; (2) an elapsed time for data collection of less than a week; (3) a 
missing data rate of 1 percent; (4) no substantive differences in mean re- 
sponses to questions in comparison to a written questionnaire; (5) fairly 
neutral attitudes toward providing data via EMS; and (6) willingness to 
provide data again via EMS. 

The population of this study included only professional men and women. 
Other kinds of employees—secretaries or managers, for example—should be 
equally able to provide data via EMS, so long as they use EMS naturally in 
their jobs. But of course researchers should investigate this issue empirically. 
The topic of this study was behavior relative to EMS. There is no reason to 
believe that other topics unrelated to EMS, like job characteristics or quality 
of work life, would lead to results different from those in this study. Clinical 
researchers, for example, have used on-line questionnaires to elicit reliable 
data about medical behaviors (Erdman, Klein, & Greist, 1983). But obviously 
different topics require further research. The questions in this study were 
relatively objective ones. Subjective indicators collected via EMS may pro- 
duce responses that differ from those collected via conventional means. This 
issue is the topic of current research (Kiesler & Sproull, 1986). 

Any researchers with access to their respondents’ electronic mail system 
could easily follow the EMS data collection procedure described in this 
paper. The results of this study do suggest some possible modifications, 
however. The behaviors of nonparticipants and cautious participants in the 
EMS version indicate that participation was at least slightly more problem- 
atic for the electronic mail research than it was for the face-to-face research. 
There are two likely reasons for this finding. One is that electronic mail 
suppresses status cues to legitimacy. In the face-to-face version, the initial 
invitation appeared on letterhead stationery from the Stanford University 
Business School; the endorsement letter was written on a corporate memo- 
randum form. Each letter was personally signed. These communications 
may carry more probative information about the status and legitimacy of 
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their senders than do identically worded messages sent electronically. Per- 
haps formal hard-copy letters could precede EMS questionnaires. The sec- 
ond reason has to do with managing attention. In the electronic mail version, 
the research request was undifferentiated from the 20 or so other messages 
each person got during the day. The participants in this research get many 
` fewer first-class letters than that; hence that form of communication is likely 
to command more attention, even apart from status cues. But additionally, in 
the face-to-face version, the researcher took the initiative and tracked down 
participants by telephoning as often as was necessary to actually talk with 
them. In the electronic mail version, each participant had to take the initia- 
tive to respond. Sending out electronic reminders could help here. 

It is also possible to suggest some reasons why the electronic mail ver- 
sion of answering questions takes noticeably longer than the face-to-face 
version. One is simple differences in transcription rates. Most people can 
probably speak? faster than they can type. A second is pacing differences. In 
the face-to-face version the researcher did not want to take any more of the 
respondents’ time than was necessary and therefore kept respondents answer- 
ing rapidly. In the electronic mail version, no external goad kept up the pace. 
A third is differences in developing a shared shorthand. In the face-to-face 
version it was common for respondents to describe a message as “just like 
the previous one," thereby eliminating the need to ask every question about 
the message. Additionally, after a few messages, many respondents learned 
the nature and order of questions and were able to answer them for each new 
message without prompting. The noninteractive nature of the electronic mail 
version prohibited the development of such shared understandings. 

EMS may prove to be a quite useful tool for organizational researchers. 
Two of its important strengths, demonstrated in this paper, are convenience 
and low cost. Transmission time is low in comparison with mailed question- 
naites. Respondents reply at their own convenience, a feature that may out- 
weigh the fact that responding to an electronic questionnaire may take slightly 
longer in elapsed time than would a comparable face-to-face or hard-copy 
questionnaire. Researchers need not be physically present to deliver or ad- 
minister questionnaires. One potential weakness is that there is no social 
presence creating a pressure to respond, so response rates may be lower 
than in face-to-face interviews— but perhaps higher than with hard-copy 
mailed questionnaires. Another weakness is that it is difficult to convey 
pictures or other graphical information with current EMS technology. In the 
study reported in this paper, EMS was clearly the preferred data-collection 
method because it produced adequate data, response rates, and willingness 
for further participation, with little expenditure of researcher time or effort 
and a high degree of convenience for respondents. 


? [n this study, the researcher took summary notes by hand. 
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APPENDIX 


For every article and research note published in the Academy of Management Journal in 
1983 the means of data collection was noted. Out of 60 articles, 34 used some form of question- 
naire or interview. For each of these articles, three additional characteristics were noted: the 
target sample size, the final sample size, and the response rate (final divided by target). In some 
cases, one or more of these pieces of information did not appear but could be inferred from 
Table Ns or calculated from other information. Table A.1 summarizes the characteristics of 
these studies. 


Lee Sproull earned her Ph.D degrze at Stanford University; she is Associate Pro- 
fessor of Social Sciences at Carnegie-Mellon University. Her current research interests 
include organizational information processing and the social aspects of computing. 
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Interest in the specific effec-s of work schedules has increased during 
recent years, focusing both on the traditional five-day/40-hour week (5/40) 
and on alternative work scheduling arrangements such as the four-day/40-hour 
week (4/40), the 4 1/2-day/40-kour week (4.5/40), staggered hours, and 
flextime. Practitioners and researchers have also examined various configura- 
tions of shiftwork schedules and have increased their efforts to document 
systematically the strengths and weaknesses of these schedules. Despite these 
efforts to understand more fully the effects of various schedules on workers 
and their organizations, most of the literature on alternative work schedules 
still focuses primarily on a relatively narrow range of workers' reactions to 
work schedules. Factors that have been studied include job satisfaction, 
stress, internal work motivation, job involvement, and organizational com- 
mitment (Glueck, 1979; Golembiewski & Proehl, 1978; Newstrom & Pierce, 
1979; Pierce & Newstrom, 1982). 

With few exceptions (cf. Allen & Hawes, 1979; Goodale & Aagaard, 1975; 
Mahoney, 1978; Nord & Costigan, 1973), practitioners and researchers have 
directed very little conceptual or empirical effort toward examining em- 
ployees' attitudes regarding specific work schedule alternatives. This, in 
turn, has somewhat limited attempts to understand workers' attitudes toward 
alternative work schedules and how these attitudes may be used to design 
successful work schedule programs. 

A 1975 study (Mahoney, Newman, & Frost) developed a set of scales to 
assess workers' attitudes toward a four-day work week. The results of this 
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study suggested promise for the assessment of such attitudes. Unfortunately, 
Mahoney and his colleagues did not show the items from their scales but 
only described them generally; nor did they provide reliability or validity 
information for the scales. To the best of our knowledge, these scales have 
not received additional development. 

A more thorough study (Golembiewski, Yeager, & Hilles, 1975) devel- 
oped a set of 18 items for assessing attitudes toward current work schedules. 
Although these items did not appear in their entirety, they apparently could 
be used with a wide range of types of current schedules. In fact, Golembiewski 
and coauthors used the set of items for a group of workers in reference to a 
5/40 schedule and later in reference to a flextime schedule. This study also 
demonstrated the potential feasibility of systematic assessment of attitudes 
toward work schedules, although it had several limitations. First, the scales 
appeared to be designed to assess attitudes toward current work schedules, 
which disallowed the possibility of anticipating reactions to alternative work 
schedules until workers actually experienced them. Second, this paper did 
not describe the development of the items, so it is difficult to determine 
which domain the 18 items represented: the researchers’ perspective, the 
subjects’ perspectives, or some other perspective. Finally, determination of 
the factor structure for the 18 items was based on only 50 subjects—a mar- 
ginal subject-to-item ratio at best. Thus, although this study demonstrated 
the importance and feasibility of assessing workers’ attitudes toward work 
schedules, further work was clearly needed. To the best of our knowledge, 
to date no one has further developed these scales. 

If an appropriate set of valid, reliable instruments were available to 

: assess attitudes toward specific work schedules, organizations would be able 
to evaluate the strengths and weaknesses of their existing work schedules 
and to compare them with the anticipated strengths and weaknesses of vari- 
ous alternatives. Such instruments could also be used to select, design, 
implement, and evaluate alternative work schedules. 

The primary purpose of our paper is to present a set of scales that can 
assess workers' attitudes toward almost any existing or alternative work 
schedule. We developed these scales during a three-phase research program 
involving six samples and over 700 clerical, technical, construction, menu- 
facturing, and health care workers working under a variety of schedules. A 
related purpose of the paper is to explore the predictive effectiveness of 
several sets of variables for anticipating workers' preferences of work 
schedules. Finally, we hope that the general design used here can serve as a 
model for researchers and practitioners pursuing instrument development in 
other behavioral areas. 


METHODS 
Study Design 


In phase 1 of the three-phase research program, the initial instruments 
were developed, administered, and tested for reliability and validity. The 
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instruments were refined in phases 2 and 3, and further reliability and valid- 
ity tests were conducted with additional samples. 


Participants 


Phase 1 employed three diverse samples—corporate office workers, trav- 
eling construction crews, and local repair crews and office workers—from a 
large midwestern utility company. These samples represented the popula- 
tion of workers for each of these three groups. Of a total 467 participants, 
approximately 23 percent were supervisory and 77 percent were nonsuper- 
visory employees. The individuals participating in phase 2—office workers 
from the same utility company— were randomly selected by department. 
This sample consisted of 28 supervisors and 104 nonsupervisors. At the time 
of the study, all participants in both phases were working under a fixed 5/40 
schedule. Phase 3 used samples from two county health departments at two 
times. The 156 nonsupervisory employees representing these two popula- 
tions included nurses, educators, sanitarians, clerical workers, and admini- 
strators. One of these samples (n = 57) worked a flextime schedule during the 
entire study period. The second sample (n = 99) worked a fixed 5/40 schedule 
at time 1 and a 4/40 schedule at time 2. 


Instrument Development 


On the basis of literature on work schedules and interviews with a wide 
cross section of 60 workers from the organization involved in phase 1, we 
constructed an initial set of 100 attitude-based items. Together with a 12- 
member labor-management committee from the utility company sample, we 
reviewed and revised the list to obtain 66 items. A priori classification 
identified 18 items that focused on attitudes toward current schedules, 10 
items that dealt with attitudes toward alternative schedules, and 38 items 
that related to perceived interference of work schedules with personal activ- 
ities. 

Instrument development in phase 2 built upon the findings from phase 
1. Two new items were written and other items rewritten so that the same set 
of questions could address either existing or alternative schedules. The results 
from phase 2 suggested a need for a more reliable assessment of the effects of 
work schedules on outside constituents such as customers or clients than we 
had. Consequently, we wrote three new items for phase 3 to try to gain this 
information. 


Data Collection 


Groups of employees, who received work release time to participate in 
the study, completed paper-and-pencil questionnaires. Subjects participat- 
ing in phase 1 answered questions regarding their attitudes toward their 
current 5/40 work schedule and about the level of interference with personal 
activities this schedule caused. In addition, each phase 1 group received the 
set of alternate work schedule items: the corporate office workers were asked 
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to react to 4/40, 4.5/40, staggered, and flextime schedules; traveling construc- 
tion crews reacted to 4/40 and staggered schedules; local crews and office 
workers reacted to 4/40, 4.5/40, staggered, and flextime schedules. 

In addition to the items related to work schedules, phase 1 participants 
answered questions relating to age, gender, educational level, marital status, 
number of children at home, personal security, company tenure, job involve- 
ment (Lodahl & Kejner, 1965), internal work motivation (Hackman & Oldham, 
1975), job satisfaction? (Dunham, Smith, & Blackburn, 1977; Smith, 1976), 
discretion concerning work schedules (Pierce & Newstrom, 1982), and a 
direct measure of preference of work schedules. 

Phase 2 participants answered the items dealing with interference with 
personal activities and the revised items on attitudes toward work schedules 
for both their existing 5/40 schedule and for a flextime work scheduling 
arrangement. Both samples of phase 3 participants completed the interfer- 
ence items and the second revision of the items on attitudes toward work 
schedules for both a fixed 5/40 and a 4/40 schedule. Two months later, we 
readministered the questionnaire to both phase 3 samples. 


Analyses 


Across the three phases of the research program, various analytic tech- 
niques examined the following characteristics of the instruments being 
developed: (1) dimensionality, (2) similarity of dimensionality across sam- 
ples and types of schedules, (3) internal consistency, (4) test-retest reliability, 
and (5) ability to predict actual work schedule preferences. Because of differ- 
ences in subject-to-item ratios, appropriateness of specific alternative sched- 
ules for particular groups of workers, and stages of scale development, we 
did not always use identical analytic procedures for all samples in each 
phase of the research program. 

To identify scale dimensionality at phase 1, we used principal compo- 
nent analyses with varimax rotations to try to find relatively pure sub- 
dimensions of the general construct. In phases 2 and 3 oblimax rotations 
further refined the several moderately correlated dimensions. At all three 
phases, we established underlying numbers of factors through combined 
use of scree tests and interpretability criteria. We calculated congruency 
coefficients (Harmon, 1975) to compare solutions across samples, types of 
schedules, and time. 


1 The questionnaire described flextime as follows: 
There are two primary features to flextime plans: 

(1) There are one or more blocks of time (core hours) each day during which all em- 
ployees must be at work. For example, all employees might be required to be at work 
between 10:00 A.M. and noon and between 2:00 P.M. and 3:00 P.M. 

(2) On a daily basis, each worker can decide when to work the remaining hours of the 
Work day. Thus, each employee is free to choose his/her starting time, break time, and 
quitting time as long as he/she works eight hours a day and is present during the core 
hours. 

? The short form of the Index of Organizational Reactions was used. 
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On the basis of the results of the component analyses, unit-weighted 
scale scores were constructed. We calculated coefficient alpha estimates 
(Cronbach, 1951) to assess the internal consistency for each scale and exam- 
ined test-retest reliability estimates for the phase 3 samples. 

As a partial test of the construct validity of the new scales, a nomological 
network was created including the constructs as measured by the new scales, 
actual work schedule preferences, and several other potential predictors of 
schedule preferences. Predictors included personal characteristics, job 
involvement, work motivation, satisfaction, and current discretion over work 
scheduling. With data from the phase 1 corporate office workers sample, we 
used a series of discriminant function analyses with classification analyses 
and multivariate omega-squared estimates (Tatsuoka, 1973) to approximate 
the nature, strength, and predictive accuracy of each set of variables in the 
network for the prediction of actual work schedule preferences. 


RESULTS 
Attitudes Toward Work Schedules 


Phase 1. The 18 items assessing attitudes toward the current work sched- 
ule and the 10 items directed toward each of the alternate schedules were 
subjected to component analyses with varimax rotations for the combined 
sample of 467 and for the two individual samples with sufficient subject-to- 
item ratios. Each analysis of attitudes toward current work schedules pro- 
duced a four-factor solution. Factor congruency estimates used to compare 
the similarity of the solutions across the samples ranged from .79 to .93 for 
the first three factors and between .39 and .73 for the fourth. Fifteen of the 18 
items subsequently defined four unit-weighted scales designated as personal 
effects, organization effects, coordination of work effects, and perceived 
uniqueness of work schedule. Coefficient alpha estimates ranged from .74 to 
.95 for the first three scales and from .59 to .69 for the fourth scale. 

Each analysis of the ten items on attitudes toward alternative work sched- 
ules suggested a two-factor solution. Factor congruency estimates for the 
solutions across samples and types of schedules ranged from .96 to .99 for 
the first factor and from .97 to .99 for the second. We used two of these items 
to define a unit-weighted scale interpreted as work coordination effects; four 
items defined a scale interpreted as personal effects; and four items failed to 
play meaningful roles because of low or complex loading. Coefficient alpha 
estimates for these two dimensions ranged from .80 to .96. 

The results of these analyses identified several shortcomings that caused 
us to revise the instruments, with help from the labor-management committee. 
Of particular concern were (1) low congruency and reliability estimates for 
the fourth current schedule scale; (2) the ability to identify only two dimen- 
sions for alternative schedules; and (3) a recognition that it would be very 
advantageous to have a single set of items for assessing attitudes toward both 
current or alternative work schedules. 
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Phase 2. As a result of the work done in phase 1, 20 revised items 
emerged as usable with any current or alternative schedule. In phase 2, these 
items were used to obtain reactions to both 5/40 and flextime schedules. 
Component analyses with oblimax rotations suggested very similar five- 
factor solutions for both analyses. Unit-weighted scale scores were formed 
for these dimensions, identified as: general affect toward schedule, work 
coordination effects, uniqueness of work schedule, family attitude toward 
schedule, and schedule’s effects on social and family life. Coefficient alpha 
estimates were very good for the. first three scales, ranging from .77 through 
.93. The fourth scale had an alpha of .78 for the 5/40 scale, but only .60 for 
the flextime scale. The fifth scale had low alphas (.40s) for both schedules. 

Two circumstances—the reliability problems identified in phase 2 and 
the inability of the instrument to identify adequately a dimension dealing 
with the effects of a schedule on outside constituents—led to the second 
major revision of these items and the research conducted in phase 3. Phase 3 
was also intended to extend assessment of the external validity and stability 
of the research scales. 

Phase 3. The second major revision of the scales on attitudes toward 
schedules resulted in a 23-item instrument. Two of the items in this set 
appeared inappropriate for respondents who were not married or who were 
living alone. These items stated “In general my family would be satisfied 
with my working..." and “My husband/wife would like me to work...." As a 
result, we decided to exclude these two items from analyses and from future 
versions of the scales. 

Component analyses with oblimax rotations for each of the phase 3 
combinations of sample, schedule, and time suggested that a six-factor solu- 
tion was most appropriate. Of the six factors (see Table 1), four were paral- 
lels of factors from phase 2. These were general affect toward schedule, work 
coordination effects, uniqueness of schedule, and schedule's effects on social 
and family life. In addition, two factors not defined in phase 2 were identified: 
effects on transportation and personal security, and effects on service to 
external constituencies.? 

Unit-weighted scales for the six dimensions across samples and types of 
schedules produced coefficient alphas that were for the most part at clearly 
acceptable levels (.70s and .80s). The major exception was the transportation 
and personal security scale, with alphas ranging from .29 to .90. Test-retest 
coefficients were also mostly strong (.60s to .80s) with a few exceptions. n 
particular, coefficients for schedule's effects on social and family life were 
poor, averaging in the .20s. 


Perceived Interference 


The 38 items assessing perceived interference with personal activities 
were analyzed for the combined sample from phase 1. The results of the 


3 The term external constituencies is somewhat awkward, but we use it here to deal with 
service to those outside of home organizations. This might include customers, clients, suppliers, 
regulatory agency representatives, and so forth. 
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TABLE 1 
Results of Principal Component Analysis? of Final Instrument 
Measuring Attitudes toward Work Schedules 





Factors 


Items” 





General affect toward schedule 


1. 


2. 


I would be very productive under 
the method of scheduling. 
The schedule would have 
a favorable influence on my over- 
all attitude toward my job, 


. My current work schedule would 


encourage me to do my best. , 


. Taking everything into considera- 


tion, I would be satisfied with my 
life in general while working a 
work week. 


. I would be dissatisfied with my 


current work schedule. 


. I personally would like the 


method of scheduling. 

A schedule would havean 
unfavorable influence on my phys- 
ical health. 


Service to external constituency \ 


8. 


The schedule would im- 
prove client access to the services 
of the Department. 


. The method ofscheduling 


would help the Department meet 
the needs of its clients. 


. The method of scheduling 


would make it easier for me to 
meet the service needs of the 
clients. 


. The methodofscheduling 


would hurt the quality of client 
service. 


Work coordination effects 


12. 


13. 





The methodofsclieduling 
would cause problems in coordi- 
nating work with my supervisor. 
The method of scheduling 
would cause problems in coordi- 
nating work with my co-workers. 


-53 


-12 


-10 


-01 


-07 


-01 


-20 


-04 


-10 
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TABLE 1 (continued) 





Items? 1 2 3 4 5 6 





Schedule effects on social and family life 


14. The workschedule would 
have an unfavorable influence on . 
my family life. 08 -03 08 86 05 04 

15. The method ofscheduling 
would make it easier for me to 
coordinate my schedule with the 
schedules of other family mem- 
bers. $ 

16. The workschedulewould 
have an unfavorable influence on 
my social life. -38 21 19 51 10 11 


Uniqueness of work schedule 


17. Almost none of my neighbors 
work a work schedule. -08 13 03 -10 85 12 

18. Almost all of my friends work a 
work schedule. -11 14 -07 -13 -78 20 


Transportation and personal security 


19. Working or leaving the work place 

at an unusual (early or late) hour 

would represent a personal secur- 

ity problem for me under the 
work schedule. 18 -09 -10 " 09 -03 89 

20. Under the method of sched- 

uling, transportation to and from 
work would be difficult. -31 10 25 01 -03 57 


Item not used in scales 


21. Overall, the method of 
scheduling would have more good 
than bad aspects. 37 53 11 -08 -04 02 





Decimal points omitted. 
bA 5-point response format was used, ranging from 1 = strongly disagree to 5 = strongly agree. 


principal component analysis with varimax rotations suggested three under- 
lying dimensions (see Table 2). Twenty-one of the items defined three scales: 
interference with activities involving family and friends; interference with 
access to services, events, and consumer goods; and interference with finan- 
cial and related activities. With one exception, coefficient alpha estimates 
for the three scales for the three samples and for the combined sample were 
very high (.81 to .93). 
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TABLE 2 


March 


Results of Principal Component Analysis: 
Interference with Personal Activities? 








Factors 
Items? 1 2 3 
Interference with activities involving family and friends 
1. Spendtimewithyourchildren. 84 33 14 
2. Maintain personal family relations. 76 30 22 
3. Spendtime with yourspouse. 76 29 24 
4. Spend time with friends. 75 35 17 
5. Takefamily onrecreational outings. 75 33 21 
8. Take partinchildren’s education. 73 20 31 
7. Attend children’s schoolactivities. 68 21 35 
8. Participate in sports with others. 66 33 27 
9. Participate in non-work organizations. 64 31 24 
Interference with access to services, events, consumables 
10. Seeamovie. 25 78 16 
11. Gototavern or buy liquor. 19 75 07 
12. Gotothelaundromat. 31 70 24 
13. Goouttoeat. 21 68 31 
14, Attend play, concert, cultural event. 41 68 19 
15. Watch favorite TV programs. 33 61 09 
16. Buygroceries. 81 58 33 
17. Goshoppingforclothes. 37 53 39 
18. Usepublictransportation. 23 45 21 
Interference with activities involving family and friends 
19. Gotothebank. 18 14 79 
20. Cashacheck. 10 18 75 
21, Gotothe postoffice. 30 07 65 
Items not used in scales 
22. Attend spectator sports. 43 47 29 
23. Gethaircutorstyled. 38 33 43 
24. Paybills. 24 49 53 
25. Getprescriptionfilled. 23 47 63 
26. Gotothelibrary. 21 41 54 
27. Visitdoctorordentist. 38 14 62 
28. Buygas. 33 45 38 
29. Continueyoureducation. 55 30 18 
30. Haverepairwork done. 57 09 43 
31. Visitrelatives. 63 39 11 
32. Doindividualsportsorhobbies. 68 84 31 
33. Attend family events. 56 35 17 
34, Gotothedrycleaners. 37 32 49 
35. Attendchurchorsynagogue. 41 58 10 
36. Doroutine workaround home. 68 24 37 
37. Dopart-time military activity. 46 38 25 
38. Spendtimealone. 37 35 19 


a Decimal points omitted. 


b Some items are abbreviated in this table; the complete items are available upon request. 


from 1 — very difficult to 5 — very easy. 


” Response format was 
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Prediction of Work Schedule Preference 


In the corporate office worker sample, several sets of potential predictors 
of work schedule preferences were used to predict workers’ actual first choices 
from a set of five work schedules that were under consideration. The sets of 
predictors included: personal characteristics, job involvement and motivation, 
job satisfaction, current discretion over work schedules, interference with 
personal activities, and attitudes toward schedules, as measured with the 
newly developed instruments. A series of discriminant function analyses 
determined whether a given set had a significant relationship with schedule 
preference, identified the percent of variance explained, and assessed classi- 
fication accuracy using the set of variables to predict schedule preference. 

Neither the job satisfaction nor the job involvement/motivation sets of 
variables had significant relationships with schedule preferences. Personal 
characteristics accounted for 3 percent of the variance with a 39 percent 
classification accuracy. Current discretion over the work schedule also 
accounted for 3 percent of the variance with a 30 percent classification 
accuracy. Interference from a work schedule explained 12 percent of the 
variance and had a 23 percent classification accuracy. The set of attitudes 
toward schedules explained 74 percent of the variance and produced a 67 

. percent accuracy in classification. The combined set of variables accounted 
for 85 percent of the variance and provided correct classification in 71 per- 
cent of all cases. 


DISCUSSION 


This paper reports the development and validation of a set of scales to 
measure workers’ attitudes toward various work schedules. This three-phase 
investigation resulted in the identification of scales to measure six attitudes 
toward work schedules: general affect toward a schedule; work coordination 
effects; uniqueness of schedule; schedule’s effects on social and family life; 
effects on transportation and personal security; and effects on service to 
external constituencies. In addition, we developed measures to assess per- 
ceived interference with three sets of personal activities: family and friends; 
access to services, events, and consumables; and financial and related 
activities. We found it particularly interesting that, in forming attitudes toward 
work schedules, workers focused not only on personal and family effects but- 
also on the consequences that particular schedules were likely to have on 
organizational effectiveness in terms of both work coordination and service 
to external constituencies. Interference caused by work schedules can be in 
one or more of the three distinct areas. We distinguished among these three 
different areas of interference because managerial actions taken to remedy 
associated problems may be very different, depending on where interference 
is most severe. 

The scales developed to measure each of the preceding dimensions have 
been shown to have adequate validity and reliability. With the possible 
exception of the uniqueness scale, the measure warranting the most serious 
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remaining questions about reliability, we believe that the scales developed 
here can be used with a wide range of employee types or schedules and can 
maintain adequate reliability and validity. Since the same scales can apply to 
many combinations of existing and alternative schedules, researchers can 
make direct comparisons for each of the attitudes across types of schedules. 

Placement of the attitude scales into a nomological network provided 
additional construct validity evidence. We showed that wide differences in 
preferences toward work schedules could not be explained entirely on the 
basis of employees’ characteristics or general affective orientations, as the 
nonempirical literature has often suggested. Attitudes must actually be 
measured; it is not adequate merely to draw inferences based on employee 
characteristics. Once schedule attitudes are assessed, we can predict work 
schedule preferences with high degrees of accuracy. Classification analyses 
conducted in these studies were able to predict work schedule preferences 
with an overall accuracy of up to 71 percent. The strongest predictors of 
work schedule preferences were specific attitudes toward work schedules— 
primarily affect—followed by personal characteristics, current scheduling 
discretion, and interference with personal activities. In many ways, it is 
these findings that most strongly support the construct validity of the mea- 
sures developed in these studies. 

An important remaining test of the utility of the scales developed here 
will involve using these scales to plan, implement, and evaluate alternative 
work schedules. This is being done in a series of studies currently nearing 
completion (Dunham, Pierce, & Castafieda, 1985). In one of these studies, 
the scales identified a flextime schedule as preferable to the existing fixed 
5/40 schedule. Using a quasi-experimental design, researchers used the scales 
prior to a work schedule change for both an experimental and control group 
and used them again three and six months after the introduction of flextime. 
The scales isolated a number of significant improvements in attitudes follow- 
ing the introduction of flextime. Perhaps more important, in terms of the 
validity of the scales, the pretest measures of flextime attitudes correctly 
anticipated subsequent reactions after flextime was actually introduced. In a 
second study to be reported by Dunham and colleagues (1985), the scales 
were used with a natural quasi-experimental design for a group of workers 
who changed from a fixed 5/40 to a 4/40 and back to a 5/40 schedule. Again 
the scales isolated a number of significant attitudinal changes, and again the 
investigators correctly anticipated these reactions prior to the introduction 
of schedule changes. 

Practitioners should find the instruments reported here of use in designing 
or modifying work schedules. More complete knowledge of workers’ com- 
plex attitudes toward work schedules will allow a much more systematic 
approach to work schedule design and redesign than has been possible in 
the past. The fact that workers' attitudes toward current and alternative work 
schedules focus both on the personal and organizational effects of these 
schedules suggests that workers' attitudes and opinions can be very useful in 
designing work schedules that benefit organizations as well as workers. These 
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scales can help organizations identify desirable alternative work schedules, 
anticipate probable positive and negative aspects of schedules, and systemati- 
cally evaluate the effectiveness of alternative work schedule interventions. 
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Introduction to Management 


Management: Concepts and Effective 
Practice, 2e by Michael A, Hitt, Texas 
A&M University, R. Dennis Middlemist, 
Colorado State University, and Robert L. 
Mathis, University of Nebraska, Omaha. 
Instructor s Manual and Test 
BankiTransparency Masters/Study 
GuidelVideotapelMicroTest-WESTEST /West 
Study. 600 pp, hardcover. 


Fundamentals of Production/ 

Operations Management, de by Harold 
E. Fearon, William A. Ruch, and Vincent G. 
Reuter, all of Arizona State University, C. 
David Wieters, New Mexico State University, 
and Ross R. Reck. 

Instructor's Manual/Test Bank. 180 pp, 
softcover. 


Organizational Behavior 


Organizational Behavior, 4e by Don 
Hellriegel, Texas A&M University, John W. 
Slocum, Jr., Southern Methodist University, 
and Richard W. Woodman, Texas A&M 
University. 

Instructor's Manual/Test Bank/Transparency 
Masters!Study Guide. 650 pp, hardcover. 


Organizational Behavior: Experiences 
and Cases by Dorothy Hai, University of 
Wisconsin/LaCrosse. 

To accompany Organizational Behavior, 4e. 
Instructor's Manual. 300 pp, softcover. 


Strategic Management 


The West Series in Strategic 
Management. Series includes: 
Strategy Formulation: Power and 
Politics, 2e by Ian MacMillan, New York 
University, and Patricia Jones, Management 
Consultant. 

Setting Strategic Goals and 

Objectives, 2e by Max D. Richards, 
Rollins College. 

Strategy Implementation: Structure, 
Systems, and Process, 2e by Jay R. 
Galbraith, Management Consultants LTD, 
and Robert K. Kazanjian, University of 
Michigan. 

Strategic Control by Peter Lorange, 
Wharton School, University of Pennsylvania, 
Michael F. Scott Morton, Sloan School, 
Massachusetts Institute of Technology, and 
Sumantra Goshal, INSEAD, Fountainbleau, 
France. 
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Macroenvironmental Analysis for 
Strategic Management by Liam Fahey, 
Boston University, and V.K. Narayanan, 
University of Kansas. 

Strategy Formulation: Issues and 
Concepts by Charles W. Hofer, University 
of Georgia. 

200 pplbook, softcover. 


Personnel 


Effective Personnel Management, 2e 

by Randall S. Schuler,New York University, 
and Stuart A. Youngblood, Texas A&M 
University. 

Instructor' s Manual and Test Bank/Study 
Guidel Software and PC Manual! 
Transparency Masters. 700 pp, hardcover. 


Case Problems in Personnel and 
Human Resource Management by 
Randall S. Schuler and Stuart A. 
Youngblood. 

Instructor's Manual. 325 pp, softcover. 


Productivity Through People by. 
William B. Werther, Jr., and William A. 


Ruch, both of Arizona State University, and 
Lynne McClure, McClure and Associates. 
500 pp, softcover. 


Organizational Theory 


Organizational Theory and Design, 2e 
by Richard L. Daft, Texas A&M University. 
Instructor s Manual/Test Bank. 571 pp, 
hardcover. 


Marketing 


Transportation, 2e by John Coyle, 
Pennsylvania State University, Edward 
Bardi, University of Toledo, and Joseph 
Cavinato, Pennsylvania State University. 
Instructor’ s Manual and Test Bank! 
Transparency Masters. 550 pp, hardcover. 


Analysis for Strategic Market 
Decisions by George S. Day, University of 
Toronto. 

Instructor' s Manual. 270 pp, softcover. 


For more information on these and other West texts, 
or to order an examination copy, write to 
West Publishing Company, College Division (KS), 
P.O. Box 64526, St. Paul, MN 55164-1003, 
or call toll-free 1-800-328-9424 
(in Minnesota and Canada call 612-228-2778). 


® 
West Publishing Company. 
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NEW FROM ELSEVIER! 


MANAGING PROJECTS: 
A Systems Approach 


Daniel D. Roman, Professor Emeritus, The George Washington University 


Today's business community is experiencing a growing trend to 
organize employees into small, entrepreneurially oriented groups. 
It represents a departure by management from function oriented 
activity to objective oriented activity in order to survive inin- ` 
creasingly volatile markets. Daniel Roman's unprecedented new 
book, MANAGING PROJECTS: A Systems Approach, provides a 
much-needed perspective on this trend by focusing on manage- 
ment within the project operational structure. The book explores 
the complete cycle of project activities from the conception of a 
project through its termination and evaluation. 


Based on the author's 30 years experience in teaching and in in- 
dustry, MANAGING PRO- 
JECTS will provide expert 
advice for the newly ap- 
pointed project manager, 
as well as invaluable in- 
sight for the experienced 
manager by covering 
the range of project ac- 
tivities in a variety of 
organizational set- 
tings. In addition, the 
book's compréhen- 
sive treatment of 
project manage- 
ment makes it an 
invaluable text- 
book for an area 
rapidly growing in 
importance. 


October 1985, 
442 pages, 
0-444-00966-3, 
` Cloth, $34.95 
(Dfl. 135 outside 
North America) 


Send orders to: 


In North America: i 
Elsevier Science Publishing Co.; Inc. 
PO. Box 1663, Grand Central Station, New York, NY 10163 


In the rest of the world: 

Elsevier Science Publishers, Direct Mail Department 

PO. Box 211, 1000AE Amsterdam, The Netherlands 

Note: Prices subject to change without notice. In NY State please add 


applicable sales tax. 
10/85 NH 167 
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Bigger doesnot alwaysmean better. 


The first computers were sobigthey treated that way. 





D A 


filled entire rooms, and had to be shut down So cali us at (513) 527-6384. And 
periodically because of overheating. remember that while some of the big, pro- 
Things got better when things got digious publishers may look impressive, they 
smaller. And that's something to consider if might not respond to your needs the way we 
vue looking for a publisher. will. The important thing about a publishing 


Now, we at South-Westem Publishing ^ company is not how big it is, but how big if 
would never suggest you select a publisher thinks, 
solely n 2 ve Ehre eeh 
suggest you select a publisher based on 
quickness of reaction fo your needs, person- SOUTHWESTERN 
alized service, and flexibility. Because your —— a 
book isnt like everyone elses. So it shouldnt be _______ COLLEGE DIVISION ` — 
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Better Business Begins 


in the Classroom... 
with texts from Merrill 


New for 1986 in Management... 


UNDERSTANDING ORGANIZATIONAL 
BEHAVIOR: A Managerial Viewpoint 

Robert E. Callahan, Seattle University 

C. Patrick Fleenor, Seattle University 

Harry R. Knudson, University of Washington 

420198-5: Casebound, $08 pages; Instructor's Manual, Test Bank 
UNDERSTANDING ORGANIZATIONAL 
BEHAVIOR: A Casebook 

Harry R. Knudson, C. Patrick Fleenor, Robert E. Callahan 
#20481-X: Paperback, 280 pages 

PERSONNEL: Management of Human Resources, 
Second Edition 

Michael R, Carrell and Frank E. Kuzmits, 

both, University of Louisville 

#28466-6: Casebound, 712 pages; Instructors Manual with Test Bank 
EXPERIENTIAL EXERCISES IN PERSONNEL/HUMAN 
RESOURCE MANAGEMENT, Second Edition 
Frank E. Kuzmits 

820424-0: Paperback, 256 pages; Instructors Manual 


FUNDAMENTALS OF STRATEGIC MANAGEMENT 
Fred R. David, Mississippi State University 
#20561-4: Casebound, 868 pages; Instructor's Manual, Test Bank 


STRATEGIC MANAGEMENT SIMULATION 
Kenneth B. Olm, George G. Eddy, Eugene Krus, Jr., 
all, University of Texas at Austin 

#20538-7: Paperback, 90 pages; Disk 


New for 1986 in Marketing... 


RETAILING, Second Edition 

Dale M. Lewison, University of Akron 

M. Wayne DeLozier, University of South Carolina 
#20474-4: Casebound, 768 pages; Instructors Manual, Test Bank. 
Student Guide, Transparency Masters 

RETAILING: Cases and Applications, Second Edition 
Dale M. Lewison and M. Wayne DeLozier 

#20558-1: Paperback, 276 pages; instructor's Manual 
DIRECT MARKETING 

Herbert Katzenstein and William S. Sachs, 

both, St. John’s University 

#20305-8: Casebaund, 480 pages; Instructor’s Manual 

with Transparencies, Test Bank 


New for 1986 in Finance... 


PERSONAL FINANCE: An Integrated 

Planning Approach 

Bernard J. Winger and Ralph R. Frasca, both, University of Dayton 
#20479-8: Casebound, 672 pages; Instructors Manual and Study Guide 

(by Paul R. Allen, Sam Houston State University), Test Bank. 

Transparency Masters, Software 


New for 1986 in Information Systems... 
BUSINESS APPLICATIONS SOFTWARE FOR 
THE IBM PCS 


Lon Ingalsbe, Portland State University 
#20476-3: Paperback, 350 pages; Instructor's Manual 
Also available by the same author: 
LOTUS 1-2-3? FOR THE ISM PC; 


{each with Instructor's Manual) 


COMPUTERS: An Introduction 

Donald D. Spencer 

#20559-X: Paperback, 352 pages; instructor's Manual, 

Computerized Test Bank, Transparencies, Software, Study Guide 
SYSTEMS ANALYSIS AND DESIGN: 

A Case Study Approach, Second Edition 

Robert J. Thierauf, Xavier University 

#20229-9: Casebound, 672 pages; Instructor's Manual 
INTRODUCING PASCAL: Workbook and Exercises 
Barbara Sparks Harris and Christine Benedyk Kay, 

both, DeVry/Chicago 

#20454-2: Paperback, 194 pages; Instructor's Manual 

THE ILLUSTRATED COMPUTER DICTIONARY, 
Third Edition 

Donald D. Spencer 

#20528-X: Paperback, 320 pages. #20529-8: Casebound, 320 pages 


Also new for 1986... 


THE LEGAL ENVIRONMENT OF BUSINESS: 
Commerce and Public Policy 

Charles R. McGuire, Illinois State University 

#20326-2: Casebound, 800 pages; Instructor's Manual with 

Test Bank, Student Guide 


FUNDAMENTALS OF BUSINESS STATISTICS 
John E. Hanke and Arthur G. Reitsch, 


both, Eastern Washington University 
#20333-3: Paperback, 400 pages; Instructor's Manual with Test Bank 


We've many more excellent texts for your courses... 
WRITE FOR YOUR COPY OF OUR NEW 1986 CATALOG! 


To receive your copy of our catalog, or to request an examination copy of any of these texts for adoption consideration, write: Merrill Publishing Company, 
ATTN: College Advertising, 1300 Alum Creek Drive, Box 508, Columbus, Ohio 43216. Or, call Chris Cole, toll-free, 1-800-848-1567; in Ohio, call 1-614-890-1111. 


MERRILL 


PUBLISHING COMPANY 
A Bell & Howell Company 
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Two New Editions from 
Rue & Byars 


Just Published 


MANAGEMENT 


Theory and Application 
Fourth Edition 


All New Content! 
All New Package! 


(Includes Color Transparencies) 
See what MANAGEMENT IN ACTION is all about! 





Leslie W. Rue, Georgia State University, and 
Lloyd L. Byars, Atlanta University 





Published Fail 1985 


SUPERVISION 
Key Link to Productivity 
Second Edition 


We've added Experiential Exercises to 


the New Leader in Supervision 
paperbound 


Examination copies for adoption consideration 
available on request. Please write: 


IRWIN 


1818 Ridge Road Homewood, IL 60430 


Irwin Management Texts for the Way You Teach. 
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ESSENTIAL REFERENCES 


From JOHN WILEY & SONS 


BANK STRATEGIC 
MANAGEMENT AND 


MARKETING 


DEREK F. CHANNON, Manchester Business 
School 


1-90383-3 248pp. 1986 $31.95 


CASES IN 
BANK STRATEGIC 
MANAGEMENT AND 


MARKETING 


DEREK F. CHANNON, Manchester Business 
School 


1-90884-3  480pp. 1986 $49.95 
MAKING DECISIONS 


Second Edition . 
D.V. LINDLEY 
1-90803-7 207 pp. 1985 $27.95 


QUANTITATIVE 
METHODS 
IN MANAGEMENT 


Case Studies of Failures and Successes 
Edited by C.B. TILANUS, Eindhoven Univer- 
sity of Technology, 0.B. DE GANS, Delft 
University of Technology, and J.K. LEN- 
STRA, Centre for Mathematics and Com- 
puter Science, Amsterdam 


1-90841-X — 304pp. 1986 $31.95 


OPERATIONS 
MANAGEMENT IN 
SERVICE INDUSTRIES 
AND THE PUBLIC 
SECTOR 


Text and Cases 

CHRIS VOSS, University of Warwick, COLIN 
ARMISTEAD, Manchester Polytechnic, 
BOB JOHNSTON, University of Warwick, 
and BARBARA MORRIS, Sheffield City 
Polytechnic 


7-90801-0  318pp. 1985 $24.95 


OFFICE 

AUTOMATION 

A Social and Organizational Perspective 
R.A. HIRSCHHEIM, Oxford University 
1-90909-2 336pp. 1986 $29.95 


OPERATIONAL 


RESEARCH 
D.J. WHITE, University of Manchester 
1-90717-0 — 361 pp. 1985 $42.95 


BRITAIN AND THE 


MULTINATIONALS 


JOHN M. STOPFORD, London Business 
School, and LOUIS TURNER, Royal Insti- 
tute of International Affairs 


1-90839-8  304pp. 1986 $24.95 


QUANTITATIVE 
METHODS FOR PUBLIC 
ADMINISTRATION 

IN DEVELOPING 


COUNTRIES 


SCOTT C. IVERSON, University of 
Washington 


1-90755-3  167pp. 1985 $34.95 


SURVEY DESIGN AND 


ANALYSIS 
FR. JOLLIFFE, Brunel University 
0-20272-6 ëm 1986 — $64.95 


To order, write to Julie Harrington, Dept. 6-0574: 


JOHN WILEY & SONS, INC. 


605 THIRD AVENUE, NEW YORK, N.Y. 10158 
OR CALL TOLL FREE: 1-800-526-5368. 


in New Jersey, call collect: (201) 342-6707. in 
Canada: 22 Worcester Road, Rexdale, Ontario 
MSW 1L1. 

Order Code #6-0574. 

Prices subject to change and higher in Canada. 


6-0574 
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ACADEMY OF MANAGEMENT 
1986 REGIONAL MEETINGS 


For more information or submissions contact the regional chairperson listed. 


SOUTHWEST—March 12-15, Hyatt Regency, Dallas, Texas. 
Linda Calvert, (713) 488-9420 
Department of Management 
University of Houston-Clear Lake 
2700 Bay Area Blvd. 
Houston, Texas 77058 


WESTERN—March 20-22, MGM Grand, Reno, Nevada. 
Patrick Connor, (503) 370-6444 
Graduate School of Management 
Willamette University 
900 State St. 
Salem, Oregon 97301 


MIDWEST—April 17-19, Park Terrace Airport Hilton, St. Louis, Missouri. 
Kenneth W. Wexley, (517) 353-5415 
Department of Management 
228 Eppley 
Michigan State University 
East Lansing, Michigan 48824 


EASTERN-—-May 7-10, The Hershey Hotel, Philadelphia, Pennsylvania. 
Michael Kavanagh, (518) 442-4929 
School of Business 
State University of New York at Albany 
Albany, New York 12222 


SOUTHERN-—November 12-15, Atlanta Hilton, Atlanta, Georgia. 
W. Alan Randolph, (803) 777-5966 
College of Business Administration 
University of South Carolina 
Columbia, South Carolina 29208 





EI 


ANNOUNCEMENTS 


National Academy 


Academy of Management 46th Annual Meeting 
August 13-16, 1986 


Hyatt Regency 
Chicago, Illinois 


For further information. contact: 


Don Hellriegel 
Vice President-Program Chair 
Department of Management 
Texas A&M University 
College Station; Texas 77843 
(409 845-1724 
a. 
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Membership 9 


Y 


Membership applications may be obtained by contacting: 


Joseph P. Yaney 
Director of Membership 
College of Business Administration 
Texas Tech University 
Lubback, Texas 79409-4320 


. Proceedings 


Copies of the Proceedings of the National Conventions may be ordered : 


from: 


Walter B. Newsom 
Business Manager 
Academy of Management Publications 
Mississippi State University 
P.O. Box KZ 
Mississippi State, Mississippi 39762 
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The Academy of Management Journal publishes articles in fields of interest to 
members of the Academy of Management. These fields of interest are reflected in the 
divisions and interest groups of the Academy listed on the inside front cover. 

The Journal publishes only original, empirical research as articles or research 
notes. Manuscripts are considered with the understanding that their contents have 
not been published and are not under consideration elsewhere. In its articles, the 
` Journal seeks to publish reports of research that develops, tests, or advances manage- 
ment theory and practice. All types of empirical methods—quantitative, qualitative, 
or combinations—are acceptable. The Journal does not publish purely conceptual or 
review articles; these are published in the Academy of Management Review. 
Atheoretical exploratory or survey research, methodological studies, replications or 
extensions of past research, and commentaries with new empirical content are also of 
interest for publication as research notes if they make an important contribution to 
knowledge relevant to management. 

Articles and research notes should be written so they are understandable and 
interesting to all members of the Academy. The contributions of research in all 
specialized areas to general management theory and practice should be made evident. 
Specialized argot and jargon should be translated into terminology in general use 
within the fields of management. 

Articles should also be written as concisely as possible without sacrificing mean- 
ingfulness or clarity of presentation. To save space, tables should be combined and 
data should be presented in the text wherever possible. Manuscripts submitted for 
publication as articles should not ordinarily exceed 30 double-spaced typewritten 
pages, including tables. Manuscripts submitted for publication as research notes 
should not exceed ten double-spaced typewritten pages, including tables. Everything, 
including tables, in submitted manuscripts should be typed in double-spaced format 
on one side of the page. Manuscripts prepared on computers should be printed on 
letter-quality printers or, if other printers are used, in double-strike or enhanced 
print. 

Decisions regarding publication of submitted manuscripts are based upon the 
recommendations of members of the Editorial Review Board and other qualified 
reviewers. All articles and research notes published in the Journal are subjected to 
the same anonymous review process. Reviewers evaluate manuscripts on their signifi- 
cance to the field, their conceptual adequacy, their technical adequacy, their appro- 
priateness, and their clarity of presentation. Reviewers' comments are made available 
to authors. J 

Submissions should be sent to Dr. Janice M. Beyer, Editor, Academy of Manage- 
ment Journal, School of Management, Jacobs Management Center, State University of 
New York, Buffalo, New York 14260.! Manuscripts should be prepared in accordance 
with the AMJ Style Guide for Authors, which is published periodically and is also 
available from the editor. Contributors should submit four copies of papers, retaining 
the original for their files. The Journal does not return manuscripts unless requested 
to do so. 


1 See From the Editor on p. 201 for new address to be used after August 1, 1986. 
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FROM THE EDITOR 


Because I will be joining the faculty at New York University in the fall, the 
Journal office will be moved in midsummer. After August 1, please send all 
manuscripts, revisions, and any other correspondence to: 


Janice M. Beyer, Editor 

Academy of Management Journal 

Graduate School of Business Administration 
New York University 

100 Trinity Place 

New York, NY 10006 
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The major objective of this study was to determine if Blau’s (1970) 
theory of structural differentiation better captures differences in scale 
between organizations or structural changes within organizations. Data 
gathered on 134 Ph.D.-granting universities over nine years permitted 
us to compare results of within-organizations tests of Blau’s theory with 
those from both cross-sectional and longitudinal between-organization 
tests. The findings suggest that the theory’s power lies in explaining 
structural differences between organizations rather than in explaining 
structural change. 


In an area that has been called a “theoretical wasteland” (Kimberly, 
1976: 573), Blau’s (1970) theory of structural differentiation in organizations 
provided a major theoretical statement concerning the effects of organiza- 
tional size on structure. This theory has inspired considerable research and 
debate (e. g., Mileti, Gillespie, & Haas, 1977; Miller & Conaty, 1980). The 
theory has also influenced many studies of formal organizational structure 
not directly concerned with testing the theory’s propositions, probably 
because of the emphasis given to organizational size and the causal priority 
the theory grants to size over other dimensions of structure. The large num- 
ber of citations referencing Blau’s initial theoretical paper evidences the 
extent of his theory’s influence. Current Contents/Social and Behavioral 
Sciences recently designated the 1970 article a “citation classic" (Blau, 1983). 
Moreover, interest has not waned; citations of the initial articles on the 
theory (Blau, 1970; Blau & Schoenherr, 1971) continue to be numerous more 
than a decade after their publication (Institute for Scientific Information, 
1984). 

In spite of the effect of the theory of structural differentiation on many 
current studies of organizational structure, even recent empirical tests of the 
theory have used cross-sectional data (Beyer & Trice, 1979). This situation is 
surprising since a small but growing body of general organizational research 
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has shown that cross-sectional differences in structure found among organi- 
zations do not necessarily reflect processes of structural change (Freeman & 
Hannan, 1975). Although a few tests of Blau's theory have used longitudinal 
data (Meyer, 1972), they merely used change-over-time as a variable in cross- 
sectional comparisons among organizations (Ford, 1980a). There are virtu- 
ally no within-organization, time series investigations of the basic relation- 
ships that Blau hypothesized. Thus, although his propositions concerning 
the relationships among size of organization, differentiation, and size of 
administration continue to influence organizational scholars, especially those 
concerned with organizational size, the proper sphere of the theory of struc- 
tural differentiation remains undefined. To what extent is it a theory of scale 
among organizations? To what extent is it a theory of change within 
organizations? 

In light of longitudinal research on organizational structure, it seems 

increasingly clear that both theoretical and empirical work should specify 
structural relationships that occur because of: (1) differences among organiza- 
tions in the scales of their operations, (2) differences among organizations in 
the degrees of change in their structures, and (3) changes within organiza- 
tional structures over time. Because Blau's work is perhaps the most influen- 
tial theoretical consideration of organizational size, differentiation, and size 
of administrative components, it seems appropriate to examine it in terms of 
these three issues. 
Thus, this study had two objectives. The first objective was to test the 
fundamental aspects of the theory using longitudinal data and a within- 
organization design. We used within-organization analyses to examine the 
theory's proposed relationships among the structural dimensions of size, 
differentiation, and administrative components over time and for individual 
organizations. The second objective was to compare these results with two 
types of models: (1) models like those typically reported for studies using 
cross-sectional data, and (2) models like those reported for studies using 
longitudinal data and between-organization designs. 


BLAU'S THEORY OF STRUCTURAL DIFFERENTIATION 


In his formal theory of differentiation, Blau (1970) inferred two general- 
izations from cross-sectional data on 53 governmental agencies and their 
local branches. The generalizations were: "Increasing size generates struc- 
tural differentiation along various dimensions at decelerating rates" (1970: 
204) and “Structural differentiation in organizations enlarges the administra- 
tive component" (1970:213). Blau operationally defined organizational size as 
number of employees, differentiation as number of formal structural compo- 
nents such as levels or divisions, and an administrative component as the 
number of staff personnel who provide support services for an organization. 

Blau derived numerous propositions from the two generalizations. These 
propositions hypothesized at least three basic relationships: (1) increasing 
organizational size reduces the relative size of an administrative component, 
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(2) increasing organizational size increases differentiation, and (3) increas- 
ing differentiation increases the relative size of an administrative staff. Blau 
(1972) later expressed the proposed empirical relationships in path analytic 
form, as shown in Figure 1. . 

Although Blau used cross-sectional data to derive his theory, the focus 
of his basic generalizations on changes in structural components implied a 
temporal dimension to organizational variability. However, the vast majority 
of research related to this theory has used cross-sectional data and examined 
only differences among organizations. As the literature reviewed in this 
study shows, the theory of structural differentiation is useful for explaining 
differences in scale among organizations. Yet we know little concerning its 
applicability to processes of change within organizations. As a result, the 
basic relationships of the theory still need to be examined over time for 
individual organizations if Blau's theory is to be considered a viable explana- 
tion of processes of organizational change. 


Issues Emerging from the Study of Change 


Research that examines Blau's basic hypotheses over time, especially 
with a within-organization design, should consider two issues that neither 
he nor the few longitudinal tests of his theory have addressed. 

The first issue is the lag structure, or the time period it takes relevant 
variables to respond to each other. Cullen and Baker (1984) demonstrated 
that the responsiveness of changes in size of administration to changes in 
enrollment varied widely in a random sample of colleges and universities. 
These findings were similar to those in other fields where within-subject 
time series analyses are more common (e. g., Assmus, Farley, & Lehmann, 


FIGURE 1 
Simplified Model of the Theory of Structural Differentiation 
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1984). Little is known about how long it takes differentiation to respond to 
changes in size or how long it takes administration to respond to changes in 
differentiation. More than just an empirical issue, investigation of lag struc- 
tures addresses an unspecified theoretical point in Blau’s work. All theories 
of change need to specify when one variable will respond to changes in 
‘another variable. Lacking such specification, any theory becomes irrefutable 
since hypothesized relationships will nearly always occur at some unspeci- 
fied time in the future. 

The second issue is how the theory of structural differentiation should 
be interpreted for declining as well as growing organizations. Although its 
originator phrased the theory’s generalizations and propositions in terms of 
growth, the derivation of the theory of structural differentiation from cross- 
sectional] data relies on an assumption concerning the results of earlier changes 
in structure. It implies that changes—growth, decline, or both—result in sim- 
ilar combinations of structural characteristics at particular levels of size at 
any time. Stated otherwise, "'[organizational] levels represent the sum of all 
change up to the point of measurement" (Dewar & Hage, 1978: 112). As a 
result, Blau’s theory at least implies hypotheses concerning decline as rever- 
sal of growth. If growth hypotheses are reversed, the following situations are 
necessary during decline in order for the sum of differences created by 
growth or decline to produce cross-sectional relationships consistent with 
the theory. 

Since relationships between size and differentiation and size and adminis- 
tration are not proportional during growth, they should not be proportional 
during decline. Blau’s theory states that during growth, decentralization 
occurs at a decelerating rate because the average size of the subunits increases 
(Blau, 1970: 206-207). This hypothesis says that there is absolutely more but 
relatively less differentiation with an increase in size. On the basis of this 
more general hypothesis concerning differentiation, Blau also hypothesized 
that the relative size of an administrative staff declines with increases in an 
organization’s size (Blau, 1970: 210). Because it is based on cross-sectional 
data, the theory of structural differentiation presumes that a decline returns 
an organization to an earlier state. Therefore, if proportionately larger adminis- 
trative staffs are expected for smaller organizations, the theory implies that 
in decline, administrative size decreases slower than organizational size. 
Similarly, differentiation should decline proportionately less than size 
declines. Because Blau did not specify whether the relationship between 
differentiation and administration is proportional during growth, support 
for his theory only requires that the form of this relationship be similar in 
growth and decline. 

Although the logic of the theory of structural differentiation has not been 
tested previously with growing and declining organizations, evidence from 
both between-organization (Freeman & Hannan, 1975) and within-organiza- 
tion (Ford, 1980a) longitudinal studies challenges this implicit hypothesis 
that growth and decline produce the same cross-sectional relationships among 
components of organizational structure. 
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In response to these issues, the within-organization models of the theory 
of structural differentiation used in this study (1) considered different lag 
structures for organization size, and (2) carefully examined differences 
between growing and declining organizations. 


PREVIOUS RESEARCH 


The following sections review selected previous research on the major 
relationships that Blau hypothesized. We differentiated studies reviewed 
along two dimensions: (1) whether they used longitudinal or cross-sectional 

data, and (2) whether their research designs were within- or between- 
organizations. The cross-sectional research we examined focused on two or 
more of the variables that figure in Blau's theory. Cross-sectional studies 
necessarily used between-organization designs examining differences among 
or between organizations. Longitudinal research having within- or between- 
organization designs examined two or more of the theory's variables over 
two or more time periods. Between-organization studies used longitudinal 
data to develop change variables for assessing variance in the theory's vari- 
ables across samples of organizations; within-organization studies examined 
variance in these variables over time for single organizations. 


` Cross-Sectional Studies 


Cross-sectional research examining the hypotheses posited by Blau essen- 
tially replicated his findings (1970, 1972, 1973). Several investigators have 
studied the relationship between organization size and structural differentia- 
tion. Most have found a positive. association. Goldman (1973) found strong 
relationships between organizational size and three forms of differentiation — 
spatial, horizontal, and vertical. Mileti and colleagues (1977) observed posi- 
tive curvilinear relationships between size and vertical and horizontal differ- 
entiation and a linear relationship between size and spatial differentiation. 
Miller and Conaty (1980) pooled data from three different sources: (1) feder- 
al and state agencies studied by Mileti and colleagues (1977), (2) department 
stores studied by Goldman (1973), and (3) an original heterogeneous sample 
contributed by Miller. They concluded that size is related positively to dif- 
ferentiation. In a study attempting a direct replication of the theory of struc- 
tural differentiation's hypothesis concerning differentiation, Beyer and Trice 
(1979) found size to have greater effects on division of labor, vertical dif- 
ferentiation, and horizontal differentiation than did Blau and Schoenherr 
(1971). 

Among the relationships postulated in the theory, the effects of size on 
administration have received the most attention; this was true even before 
Blau developed his theory. He hypothesized a positive correlation between 
organizational size and the absolute size of the administrative component 
and a negative relationship between organizational size and the relative size 
of the administrative component (Blau, 1970: 208). In their review of the 
literature, Jackson and Morgan (1978: 238) found unqualified support for a 
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positive relationship between the absolute size of both components. Bedeian’s 
(1980) review of 26 cross-sectional studies with data on organizational size 
and administrative ratios reported that 18 studies observed a negative rela- 
tionship between organizational size and relative size of administration. 
Gillespie and Mileti (1976) concluded from their literature review that, with 
size controlled, there are positive associations between the ratio of adminis- 
trators to organizational size and both vertical and horizontal differentiation. 
All of these findings are consistent with the theory of structural differentiation. 


Between-Organization Longitudinal Studies 


Most longitudinal studies investigating organizational structure employ 
between-organization designs. Such studies (e. g., Akers & Campbell, 1970; 
Freeman & Hannan, 1975) use changes observed within individual organiza- 
tions in comparisons across or between cases or organizations. For example, 
in a between-organization design with longitudinal data, once an organi- 
zation’s percentage of growth or decline has been measured, the analysis 
uses this variable—just like any other variable measured at one point in 
time—for estimating relationships across or between organizations. 

A small number of data points on which change can be measured and 
the variability of patterns of change within organizations make it difficult to 
determine how well relationships for change variables derived from between- 
organization research pertain to the same changes within individual organi- 
zations (Ford, 1980a: 617). For example, since between-organization studies 
usually apply one lag time to all variables and to all organizations studied, 
differences among the lag structures for individual organizations may obscure 
subtleties of relationships. Similarly, a single estimate of a relationship 
between change variables for an entire sample is sensitive to outliers in ways 
that may mask findings. It is possible, for example, that most organizations 
in a sample could have a relationship over time when a between-organization 
slope suggested no such relationship. 

However, since between-organization studies with longitudinal data do 
consider change and can contrast processes of growth and decline, it is unfor- 
tunate that only a few such studies exist—especially investigating the theory 
of structural differentiation or its constituent relationships. Using data col- 
lected at two points, Meyer (1972) confirmed his hypothesis that the relation- 
ship between size and differentiation was unidirectional; he found that size 
affects differentiation. In contrast, a later study by Dewar and Hage (1978) 
concluded that organizational size did not affect vertical or horizontal differ- 
entiation. 

In a six-year study that supported aspects of Blau’s theory, Hendershot 
and James (1972) found a negative association between organizational size 
(enrollment) and administrative intensity (the ratio of principals and supervi- 
sors to teachers} in rapidly growing organizations. However, they found no 
consistent associations for organizations that grew slowly. Daft and Becker 
(1980) hypothesized and found that organizational size had a positive rela- 
tionship with the absolute size of the administrative component. 
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Freeman and Hannan (1975) examined organizational size and size of 
the administration for both growing and declining school districts. The rela- 
tionship in growing districts between district size and supportive com- 
ponents—an inclusive measure of administration ——was positive, proportional, 
and relatively consistent with Blau's theory. Declining organizations removed 
members from their teaching or direct components faster than from their 
supportive components. Similarly, Freeman (1979) found a positive relation- 
ship between district size and size of the supportive components in both 
growing and declining districts. Enrollment had a stronger positive effect on 
supportive components in growing districts than in declining districts. 

If the cross-sectional differences found originally in the derivation of 
Blau's theory represent the results of changes that predated his analysis, the 
diseconomies in administration associated with small organizations should 
also be produced in formerly larger organizations moving into the smaller 
size because of decline. Thus, findings of decreased economies of scale with 
decline are logically consistent with the theory of structural differentiation, 
even though Blau did not deal with decline explicitly. However, critics have 
suggested that highly rational approaches to organizational change, such as 
Blau's, ignore the possibility that the political power of administrators 
becomes manifest during decline and is used to maintain the positions of 
those in power (Hannan & Freeman, 1978). 

Without direct study of the politics of decision making in an organiza- 
tion it is difficult to assess the degree to which rationality or political power 
prevail. Hannan and Freeman's (1978) findings did indicate indirectly that 
political power operates to increase retention of administrators in decline 
more than in growth. Ford (1980b) also argued that the relationships between 
size and various components of structure in growing organizations will not 
mirror those relationships found in declining organizations. During periods 
of decline, organizational decision makers may offer employees inducements 
to stay in an attempt to stem the processes of decline. Ford referred to this as 
the participation problem (1980b: 592-593). Commitment of an organization's 
decision makers to its status quo—their resistance to change-—may shape 
relationships in decline. Finally, like Hannan and Freeman (1978), Ford 
argued that political considerations during decline may lead to an organi- 
zation's members wishing to maintain their positions, thus limiting the 
organization's ability to adapt. 

Daft and Becker (1980) hypothesized that organizational complexity is 
related positively to size of the administrative component. They found a 
positive but weak association between complexity and changes in size of 
administrative components. Moreover, the relationship was reciprocal, with 
an additional weak but positive association between size of administrative 
components and changes in complexity. 

Although longitudinal research with between-organization designs has 
tended to support the major hypotheses of Blau's theory, these studies pro- 
vided less support than has cross-sectional research. 
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Within-Organization Longitudinal Studies 


A within-organization longitudinal design typically examines relation- 
ships over time for single organizations. Data are thus compared within 
individual organizations rather than between different organizations. Such 
studies then use regression models to explain variance in a time series for 
two or more organizational variables. Regressions for single organizations 
are usually compared descriptively. 

In spite of some early within-organization research (Tsouderos, 1955), 
longitudinal studies of organizational structure with within-organization 
designs are still rare (Kimberly, 1978). The difficulty of obtaining numerous 
data points in a time series probably accounts for this scarcity. We know of 
no within-organization studies of the theory of structural differentiation that 
have considered organizational size, differentiation, and size of administra- 
tion simultaneously. However, there is at least one recent within-organization 
study of organizational size and size of administration. 

Ford (1980a) collected data over ten years (1968-77) on 24 public school 
districts, hypothesizing little convergence between the results of cross- 
sectional analyses and the results of longitudinal analyses. He concluded 
that it was “problematic” (1980a: 627) to infer organizational processes from 
the results of research using cross-sectional designs. Ford used size elas- 
ticities—proportional changes over time—to assess administrative economies 
of scale in growing and declining organizations. For school districts with 
growing enrollments, he found diseconomies of scale for instructional 
administrators while approximately 50 percent of the school districts had 
economies of scale for noninstructional administrators. Most declining school 
districts added both types of administrators. Although the inverse of Blau's 
hypothesis of administrative economies of scale would suggest diseconomies 
during decline, these diseconomies should be slightly greater than propor- 
tional in order for the differences in scale between organizations to remain 
consistent with his theory, which states that small organizations have pro- 
portionately more administrators. However, the absolute increases in admin- 
istrative size during decline that Ford found are diseconomies exceeding 
those predicted by Blau; they perhaps result from the differential power 
of administrators (Hannan & Freeman, 1978), or from structural hysteresis 
(Ford, 1980b). 


Summary 


Although the investigations cited in the preceding sections represent 
only some of the research on the major variables of the theory of structural 
differentiation, they illustrate the basic trends in the findings for three differ- 
ent methodologies (cross-sectional, between-organization with longitudinal 
data, and within-organization analyses). Most cross-sectional research has 
tended to confirm the theory's major hypothesized relationships for size, 
differentiation, and administration. For growing organizations, within- and 
between-organization longitudinal research has tended to support the 
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hypothesis of administrative economies of scale. In addition, studies that 
have looked at organizations declining in size have often found support for 
the inverse of the hypothesis of economies of scale during growth; dis- 
economies with decline were found. However, these diseconomies have 
often been more extreme than those suggested by the theory of structural 
differentiation, with many organizations actually increasing the absolute 
sizes of their administrative staffs. The few longitudinal studies of differ- 
entiation, either as a predictor of administrative size or as a dependent 
variable to organization size, have been inconclusive in their assessment of 
Blau’s theory. 

It is noteworthy that there has been almost no research examining the 
three major variables of Blau’s theory simultaneously and longitudinally. 
Moreover, those longitudinal investigations that do exist have between- 
organization designs. Thus, to better understand the processes that Blau and 
others implicitly hypothesized to take place over time, further longitudinal 
research on the theory of structural differentiation seems necessary (cf. Meyer, 
1971, 1972). Cross-sectional research has already provided fairly strong sup- 
port for the theory. Results from within-organization longitudinal research— 
when contrasted with those from cross-sectional and between-organization 
longitudinal research—-can help determine the degree to which the theory ap- 
plies to changes within individual organizations or to differences among or- 
ganizations in the relative scales of their structural characteristics. The study 
reported here does just this by testing selected basic hypotheses of Blau's 
theory using cross-sectional models, between-organization longitudinal 
models, and within-organizational models. The within-organization models 
show internal change processes in individual colleges and universities over 
a nine year period. 


METHODS 
Population and Data Source 


Four-year universities that granted Ph. D. degrees were chosen as the 
population of organizations. We collected data on these organizations over 
nine academic years, from 1975-76 through 1983-84. For simplicity, we will 
refer to this period as 1975 through 1983. An original population of 251 
Ph. D.-granting universities appeared in the 1973 version of the American 
Universities and Colleges (Furness, 1973). Missing data reduced the number 
of organizations we studied to 134. As a result, there were 1,206 observations 
or data points (9 years x 134 organizations). 

The source of data was the Yearbook of Higher Education, volumes 7-15 
(Marquis Academic Media, 1975/76-1983/84). Observations began with the 
1975-76 volume because the Yearbook began reporting data consistently on 
faculty size during that year. Before that time, the only measure of size 
available was student enrollment. Since Blau's theory deals with differentia- 
tion in organizational workforces, data on faculty size were necessary to 
provide an appropriate measure. 
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Although published data sources are invaluable for providing time series 
data on organizations (Miller & Friesen, 1981), they provide few variables for 
introducing statistical control. We selected universities that granted Ph.D. 
degrees because we needed a population that was relatively homogeneous in 
organizational purpose but varied in size, differentiation, and administration. 
Public universities represented 55.2 percent of the data set. Relationships 
reported in the Results section were not changed substantially when con- 
trols were introduced for public auspices. 

The academic year was used as the period of measurement because ` 
universities normally adjust budgets, students, faculty, and administrators 
yearly. 


Variables 


In line with the theory of structural differentiation, variables represent- 
ing organizational size, size of administration, and structural differentiation 
were included in the study. As did Blau's (1973) study of colleges and 
universities, this test of his theory focused on academic workforces and 
academic administrative components. 

The measure of structural differentiation was the number of academic 
departments. In Blau's test of his theory with academic organizations (1973), 
he used number of academic departments to indicate academic division of 
labor. Because faculty comprise the primary workforce differentiated into 
academic departments by universities, the number of faculty was used as the 
measure of organization size. Full-time equivalent faculty was defined as the 
number of full-time faculty plus half of all part-time faculty. 

Although the Yearbook reports data on all types of administrators above 
the level of department chairpeople, many of these administrators deal with 
business, finance, or student personnel functions rather than with faculty in 
academic and teaching activities. Thus, we used the number of full-time aca- 
demic administrators above the level of department chairpeople to represent 
size of administration. Academic administrators were defined as those with 
direct or indirect authority over teaching or research functions. Independent 
classifications of administrators into academic or nonacademic categories on 
the basis of titles showed 91.7 percent agreement between two coders. The 
'senior author resolved discrepancies between coders. Academic administra- 
tors included presidents or chancellors; all academic deans; and all division 
heads for instruction, academic affairs, special research or teaching programs, 
adult or continuing education, and research. Titles for division heads included 
vice-president or vice-chancellor, dean, and director. 

Several arguments supported choosing academic administrators above 
the level of department chairpeople as the administrative category. First and 
most important in terms of the theory of structural differentiation, we assumed 
that academic administrators were the administrative component most 
directly linked to changes in academic differentiation and to changes in the 
size of academic workforces. Second, the most detailed data published in the 
Yearbook concerned academic administrators. Third, it is generally better to 
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investigate a homogeneous category of administrators since relationships 
between sizes of administration and other variables might differ among vari- 
ous possible categories of administration—an issue raised even before the 
publication of the functional theory of differentiation (Rushing, 1966). Fourth, 
although the Yearbook reported some data for administration in areas such 
as business and finance, these data alone were not sufficient to test Blau’s 
theory since no data were reported on either size or departmentalization of 
nonfaculty workforces. Fifth and finally, we examined academic administra- 
tion above the level of department chairpeople to avoid definitional depen- 
dency between size of administration and differentiation; such dependency 
is a common danger in relating components of organizational structure (Meyer, 
1971). Since departmentalization was the measure of horizontal differ- 
entiation, adding the number of chairpeople to the size of an administrative 
staff would have been virtually equivalent to adding number of departments, 
an independent variable, to size of administration, the dependent variable. 
The relationship between these variables would then have been biased. 

Although Blau's theory does not give theoretical prominence to one type 
of differentiation, it does hypothesize that differentiation occurs along two 
basic dimensions—the horizontal and the vertical. Academic departments 
exhibit two forms of horizontal differentiation—into subunits and into spe- 
cialized tasks. Previous researchers have noted both positive and negative 
simple correlations between horizontal and vertical differentiation (Beyer & 
Trice, 1979; Miller & Conaty, 1980). Positive relationships have sometimes 
disappeared (Blau & Schoenherr, 1971) when studies have controlled for size. 
One interpretation of such findings is that one form of differentiation may 
substitute for the other, especially within given size levels (Blau, 1973: 58), 
even though size shapes both forms. As a result, an inclusive test of Blau's 
theory with longitudinal data would require examining vertical as well as 
horizontal differentiation. Unfortunately, no information was available in 
the Yearbook for vertical differentiation or number of levels in the organiza- 
tional hierarchy of the university sample. 

This lack seemed less of a problem for testing Blau's theory with aca- 
demic organizations than it would have been for a test using another type of 
organization. Our basic reason for this assertion is that, perhaps for historical 
reasons, there is little variance and a restricted range in the number of hierar- 
chical levels in academic organizations. Blau's (1973) sample of universities 
and colleges showed a mean of 4.4 levels (s. d. — 0.7). Blau (1973) argued that 
most organizations had four levels (in terms of his measure): (1) faculty— 
chair, (2) chair— dean, (3) dean— vice-president, (4) vice-president— president 
(1973: 29-30). He noted further that “very few [organizations] have more 
than five or fewer than three levels" (1973:30). Given the nature of these 
data, it was not surprising that his test of the theory of structural differentia- 
tion (Blau, 1973: 54-58) explained considerably more variance in college and 
university departmentalization (R? — 0.73) than in number of hierarchical 
levels (R? — 0.36). Thus, although vertical differentiation in academic organi- 
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zations is associated with size, it does not seem to be their major form of 
differentiation. 


Procedures 


The analyses focused on the three basic relationships shown in Figure 1: 
(1) organizational size predicting size of administration, with differentiation 
controlled, (2) organizational size predicting differentiation, and (3) differ- 
entiation predicting size of administration, with organizational size controlled. 
When these three relationships are considered as a path model, two regres- 
sion equations are necessary to examine them: the regression of differentia- 
tion on organizational size, and the regression of administration size on 
organizational size and differentiation. 

Blau (1970: 210, 214) initially tested the hypothesis of an economy of 
scale in administration for the simple bivariate relationship between size 
and administration and for the direct effect of size on administration with 
differentiation controlled. The present study considered only the direct effect, 
as this was the procedure Blau used in his later work (1973) on colleges and 
universities. Moreover, this procedure is consistent with the equation sug- 
gested by Figure 1 when the figure is considered a path diagram. 

Within-organization analyses. We computed separate ordinary least 
squares (OLS) regressions over time for each organization. For each organi- 
zation, two basic equations were estimated to test the hypothesized within- 
organization (time series, t — years) relationships: 

DIFF, = SIZE, ; (1) 

ADMIN; = DIFF; + SIZE, Jj (2) 
OLS estimates were used because examination of the Durbin-Watson statis- 
tic showed only a small degree of autocorrelation. Moreover, parameter esti- 
mates are usually unbiased even with autocorrelated disturbances (Ostrom, 
1978). 

Following Ford (1980a), we did not use inferential statistics since the 
organizations studied were not a random sample, nor were the observation 
years sampled randomly over organizational life spans. Regressions over the 
observation years merely describe yearly changes in the characteristics of the 
observed individual organizations during the time of the study. We consid- 
ered Blau's theory confirmed or negated on the basis of two criteria, slope 
direction and magnitude. This procedure is not inherently biased for or 
against Blau's theory but only reveals whether the observed organizations 
behaved consistently with the theory from 1975 through 1983. We make no 
attempt to generalize to all academic organizations or to the entire life spans 
of the organizations examined. 

For two reasons, we transformed all variables to a proportional scale. 
First, the theory of structural differentiation considers proportional or rela- 
tive changes in relationships among variables, such as the administrative 
economy of scale during growth in size. Second, earlier within-organization 


1 DIFF = differentiation; SIZE = organization size; ADMIN = administration size. 
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studies (Ford, 1980a) have shown the value of using size elasticities to exam- 
ine the results of proportional changes in size over time. For both indepen- 
dent and dependent variables, we used a logarithmic scale of base 10 to 
assess the elasticity or responsiveness of a proportional change in one vari- 
able to a proportional change in another variable. 

Using proportional transformations on both sides of an equation makes a 
straightforward interpretation possible. A slope of 1 in a regression equation 
indicates that a change of one percentage point in the independent variable 
produces a change of one percentage point in the dependent variable. A 
slope of less than 1 indicates a proportionally smaller change in the depen- 
dent variable, and a slope of greater than 1 indicates a proportionally greater 
change in the dependent variable. For example, a positive slope of less than 
1 shows an administrative economy of scale for a growing organization. 
Such transformations have the additional advantage of showing proportional 
changes without using potentially suspect ratio variables (Freeman & 
Kronenfeld, 1973; MacMillan & Daft, 1979). Although many authors, includ- 
ing Blau (1970, 1973), used logarithmic transformations of size in estimat- 
ing the nonlinear or declining effect of size on untransformed measures of 
administration and differentiation, theoretical interpretation of regression 
slopes is more ambiguous with this method than it is when both sides of an 
equation are measured proportionally. 

Growing and declining organizations were analyzed separately, consis- 
tent with previous longitudinal investigations (Ford, 1980a; Freeman & 
Hannan, 1975). An organization was defined as growing if the slope of the 
regression of organizational size on the yearly observation dates was positive. 
A negative slope indicated a declining organization. This is essentially the 
same as comparing the mean size of an organization in the later years of the 
study with its mean size in the earlier years. 

Time lags for administrative changes following changes in organization 
size vary (Cullen & Baker, 1984; Freeman, 1979). Moreover, researchers in 
other fields have observed variable lag structures for organizational actions 
(Assmus, Farley, & Lehmann, 1984). Given these considerations, we exam- 
ined three lag structures for organizational size in testing Blau’s theory: (1) 
contemporaneous, with organizational size and the dependent variables, 
differentiation and size of administration, measured over the same years; (2) 
the one-year lag, with size observed one year earlier than the dependent 
variables; and (3) the two-year lag, with size observed two years earlier than 
the dependent variables. The following six equations were computed for 
each organization: 


DIFF77.33 = SIZE77.g3 f (3) 
DIFF77.33 = SIZE76-82 , (4) 
DIFF77-83 = SIZEz5.81 ; (5) 
ADMIN:-7.83 = DIFF 77.33 + SIZE;7.83 R (6) 
ADMIN: aa = DIFF 77.33 + SIZE76.82 > (7) 


ADMIN?77.33 = DIFF aa + SIZE75.51 . (8) 
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Because previous research has identified no best lag structure for the 
effects of organizational size on size of administration (Cullen & Baker, 1984), 
the within-organization tests of the theory’s basic hypotheses reported in 
this study were based on each organization’s best lag model for relationships 
of administration and differentiation with size. The following criteria were 
used to determine the best lag models for the relationships of organizational 
size with size of administration and differentiation. If the R? for a size lag 
(either contemporaneous, one-year, or two-year) was .1 or larger than the 
other two, we selected that model as the best lag structure for size. If the 
highest R? for any equation was not at least .1 or larger than the other two, we 
classified the lag structure for organizational size as ambiguous and used the 
contemporaneous model for analyses. If no variance in the dependent vari- 
able was shown, there was no relationship and hence no appropriate lag 
structure. 

Cross-sectional and between-organization longitudinal analyses. The 
same relationships tested within organizations were tested using cross- 
sectional models and between-organization longitudinal models. For illustra- 
tive purposes, we used 1977 and 1983 data for two cross-sectional tests of 
the theory of structural differentiation. Between-organization longitudinal 
models used the same two years as data points for the development of change 
variables. We chose 1977 and 1983 to facilitate comparisons with the within- 
organization analyses where variation in administration from 1977 to 1983 
represented the final variable in the tests of Blau’s theory. Multiple regres- 
sion equations representing the relationships posited in Figure 1 were used 
in both types of analyses for 134 organizational subjects. 

Variables were the same as those used for the within-organization 
analyses. However, three different measurement strategies were used to match 
the measurement approach used in this study’s within-organization analyses 
and measurement procedures found commonly in the literature. First, for 
both the cross-sectional and between-organization longitudinal analyses, we 
transformed all variables logarithmically to compare them directly with the 
within-organization models reported in this study. Second, we used abso- 
lute measures of all variables for consistency with previous research that has 
examined these variables in absolute terms. Third, we used ratio measures of 
administration size only in the two cross-sectional regressions to achieve 
consistency with much of the past cross-sectional research including Blau's. 
The Results section presents more details on these analytical strategies. 


RESULTS 
Within-Organization Analyses 


Of the 134 organizations in the sample, 96 were growing in size and 38 
were declining. Table 1 presents a cross-tabulation of the best lag models for 
organizational size by the two relevant relationships hypothesized in Blau's 
theory and by growth and decline. No one lag structure was found appropri- 
ate for size under conditions of either growth or decline. Separate chi-square 


217 


Cullen, Anderson, and Baker 


1986 


‘saseyjuared ur payrodex ore suorjezrueg1o jo srequinu aynjosqyq 

(OUO = d$ = fp'zgz = zX) drys 

"Uon[e1uomexnsturupe--— ezrs aq 10 eurpoop/q018 Aq sepour jo uonnqruisrp Teo = d'p = fp'zr'v = ;X) diysuonrjaz uonenuo9grTp «—ezrs au 10] 
eur[oap/qA018 Aq spepour jo uormqrnsrp "Topp = d'p = Íp'sc'8 = ,X) suonezruego Sotortoap ur sdrysuoyeyes ezrs Aq S[epour jo uonnqmsrp "Teen = 
d'y = fp'cg'v = ,XjsuonezrueS10 8umoI8 uy sdrqsuomne[e1 azts Aq spopour jo uonnqristp "Motto se pejndutóo aram 8188) arenbs-1y9 aje1edog, 








(zt) 9'te (8) T'rz (£) 8'4 (F1) e'o€ NIMGV «— IZIS 
(1) ez (Zt) ere (ot) Cas (01) eo (s) zer ddd ~— IZIS auyoəq 
(9) ca (6%) ITs (zt) gat (01) vor (61) ger NIWQV -— 4ZIS 
(4) EA (o£) «ze (61) ger (St) ger (6r) eer Addi ~~ IZIS ymor 






99UPLIPA i 
ON i 
qS[9poyA Se sag 
»9ZIS WWM sdrysuoneay Aq pue 
97IS ut aurp»aq pue uo Aq IZIS 107 sjapoyy Set 1seg jo saseyuao19g 
LTISVI 


z D 0 Sdtqsuone[osr a8ueyy jo uonparq 


218 Academy of Management Journal June 


tests on the cross-tabulations of lag models by different relationships within 
the growth and decline categories and on the cross-tabulations of the same 
relationships across growth and decline were not significant. 

Table 2 shows the distribution of regression slopes for the three relation- 
ships hypothesized in the theory of structural differentiation. For relation- 
ships in which size was an independent variable, data pertain to the best 
model, if one existed; data pertain to the contemporaneous model for rela- 
tionships with ambiguous lag structures for size. Where there was no vari- 
ance in the dependent variable, there was by definition no support for Blau’s 
theory. 

For growing organizations, the theory of structural differentiation 
hypothesizes absolute increases in size of administration and differentiation 
that are proportionally less than increases in organizational size. For declin- 
ing organizations, the theory hypothesizes—albeit implicitly—absolute 
decreases in size of administration and differentiation that are proportion- 
ately less than decreases in organizational size. Since this study computed 
regression equations separately for all growing or declining organizations 
and since the slopes of these equations represent the relationships between 
proportionately transformed variables over time, positive slopes of less than 
1 demonstrate support for the theory for either growing or declining or- 
ganizations.? Positive slopes of less than 1 for regressions of differentiation 
or administration on growth in organizational size indicate situations where 
these dependent variables grew more slowly than did organizational size. 
That is, for such organizations, growth resulted in proportionately less 
differentiation and fewer administrators. Positive slopes of less than 1 in 
regression equations for declining organizations indicate situations where 
administration size or differentiation declined more slowly than did organi- 
zational size; decline resulted in proportionately more differentiation and 
more administrators. 

Most organizations had positive slopes in the relationships of size and 
administration and size and differentiation regardless of whether they grew 
or declined during the study years. However, the forms of the relationships 
did not always follow expectations derived from Blau's theory. 

The regression of differentiation on increases in size resulted in positive 
slopes for 56.4 percent of the growing organizations. Positive slopes of less 
than 1 occurred more frequently (39.696) than did positive slopes greater 
than or equal to 1 (16.896), indicating that increases in differentiation often 
occurred, as expected, less-than-proportionally to increases in size. For orga- 
nizations declining in size, positive slopes indicated a reduction in horizon- 
tal differentiation for 60.5 percent of the organizations. For these organiza- 


? When regressions are estimated between variables over time, neither the direction nor 
magnitude of the slopes obtained show relationships with time; rather they represent the 
covariation between the. independent and dependent variables. For example, a positive slope 
for size of administration regressed on organizational size means only that these two variables 
grow or decline together, regardless of whether organizational size increases or decreases during 
the time period considered. 


219 


Cullen, Anderson, and Baker 


1986 





'seseqjuered ur paj1oder are suonezyuesio jo szequmu aynjosqye 


beggen ERE 








NIWGV -«— ddd 
NIWGV -— AHAIA 


NIWGV +— AZIS 
Aad -— IZIS 


NIWGV -— ZIS 
AHI -— ZIS 


sdrgsuonejoy 


UuOpnenuoiegtp ut auto 
per Ip ur eut[oeq 


UOTnernueisgrp ur ymor 
ezis ut sulfa 


9ZIS Ul mors 
asueyy jo uornoexiq 


(£) €'Z (91) 06€ (8) ger (Z) O41 (01) v'vz 
(2) ee (6) 9°6 (f£) oa (41) aT (82) z'oe 
(1) 9'z (01) voz (c1) ste (c) ter (OL) pas 
(1) os (z) v's (z1) s'te (v) Sot (61) one 
(9) £'9 (61) et (pz) o'sz LU zu (og) ete 
) ez (6) €'6 (92) ozz (91) got (8€) geg 
90UPIXPA LS t> t= I 
ON @anesan aantsog 
suonnqrsi adors 


Ls EE —— ————————————————————O EE EEN 


IPPON ur sdnjsuorje[os 10g 


eurpe( pue ymoIrg Aq sadojg uorssa1day uorjezrues10-ungtM 9Anesoaw pue aAnisog jo soSejues10q 


z TISV.L 


220 Academy of Management Journal June 


tions, differentiation changed most often with relatively less magnitude than 
did size—a finding consistent with Blau’s theory. However, in direct contrast 
to that theory’s expectations, approximately 40 percent of the organizations 
in this study either deleted departments during growth in size (shown by 
negative slopes), added subunits during decline (also shown by negative 
slopes), or did not alter their levels of differentiation in either a growth or 
decline. 

Only the 31.3 percent of the growing organizations with administrative 
staffs that increased less-than-proportionately to their growth in size showed 
the specific type of economy of scale predicted by the theory of structural 
differentiation. Blau (1970, 1972, 1973) predicted consistently that increas- 
ing size increases the absolute size of an administrative component while 
reducing its relative size. The 44.7 percent of the growing organizations with 
negative slopes for regressions of size of administration on organizational size 
showed gains in administrative economies, but these were through absolute 
reductions in the sizes of their administrations. Similarly, only 26.4 percent 
of the declining organizations had positive slopes of less than 1, indicating 
the expected less-than-proportional reduction in numbers of administrators 
during decline. The majority of declining organizations (57.9%) had propor- 
tionately larger administrative staffs, not because size of administration 
decreased more slowly than did the decline in size, but because the number 
of administrators increased absolutely. The negative slopes in Table 2 show 
this finding. 

To better understand the directional effects produced by regressions of 
size of administration on differentiation, we tested Blau’s hypothesis of a 
positive effect of differentiation on size of administration separately for orga- 
nizations undergoing growth or decline in differentiation. As with the growth 
and decline classifications for size, an organization was classified as growing 
in differentiation if it had a positive regression slope for differentiation on its 
observation years. A negative slope identified decline. Observation years for 
differentiation were 1977 through 1983. These years match those used for 
differentiation when it was a dependent variable regressed on organizational 
size for the same (1977-83) or earlier years (1976-82 or 1975-81) (Equations 
3-5 in the Methods section). The majority (69.496) of organizations grew in 
differentiation and the remainder declined. Slightly less than 50 percent of 
the organizations with growing differentiation had the expected positive 
relationship between differentiation and administration. In most of these 
cases, administration grew at a rate proportionately less than the rate at 
which differentiation grew. The majority of organizations with declining ' 
differentiation added proportionately more administrators in spite of decline 
in number of departments. 


Between-Organization Analyses 


For comparison with the within-organization analyses reported in the 
previous section and with previous between-organization research, data from 
two of the years covered by the within-organization analyses were used to 
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compute regressions representing relationships posited by the theory of struc- 
tural differentiation. We conducted separate cross-sectional tests of the the- 
ory with 1977 and 1983 data. These same years also provided the data for a 
test using two points of longitudinal data with a between-organization design. 
As noted in the Methods section, we used logarithmic transformations, abso- 
lute values, and ratio measures of administration to match both previous 
research and our within-organization analyses. Replication of both methods 
and measurement of earlier between-organization studies was necessary for 
at least one major reason: to the extent that this study’s between-organization 
findings are consistent with similar previous studies, they suggest that the 
data examined here are not unique. If they are not unique, a stronger argument 
can be made when within-organization analyses produce different results 
than between-organization analyses that cross-sectional comparisons among 
organizations do not reflect organizational change. 

Cross-sectional analyses. Cross-sectional models supported Blau’s the- 
ory to a greater degree than within-organization models based on the same 
data, as Table 3 shows. In two separate years, cross-sectional differences in 
size had a strong positive relationship with differences in differentiation and 
with differences in absolute sizes of administrative staffs. Unstandardized 
slopes for the equations in which size and administration were transformed 
logarithmically, shown in the column headed LNADMIN, demonstrated that 
proportional differences in size produced proportionately smaller adminis- 
trative staffs. Although positive, the slopes for LNADMIN were less than 1. 
The negative effects in the two equations of administrative ratios (the ratio of 
administration size to organizational size) on organizational size also con- 
firmed these findings. The regressions of administration size on the abso- 
lute amount of differentiation showed that organizations with more differen- 
tiation had more administrators even with size controlled. In addition, there 
were direct effects of differentiation on the administrative ratio. However, 
equations with logarithmic transformations indicated that the relationships 


TABLE 3 
Cross-Sectional Equations for Selected Years? 





Dependent Variables” 


Independent Variables Years DIFF ADMIN ENADMIN % ADMIN 
SIZE* 1977 .70 27 71 [.42] -.39 
1983 73 ER 61 [.36] -.35 
DIFF4 1977 — AT .02 [.02] -.18 
1983 — .33 .09 [.07] -21 


aStandardized slopes; unstandardized slopes are reported for transformed variables in 
brackets. 

bRegressions with administration as the dependent variable included both size and dif- 
ferentiation as independent variables. 

"Transformed to base 10 log for the regression of LNADMIN on size. 

4Transformed to base 10 log for the regression of LNADMIN on differentiation. 
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between the magnitudes of differentiation and administration were not 
proportional. 

Between-organization analyses with longitudinal data. Table 4 reports 
eight equations representing two approaches used to study the major rela- 
tionships posited by the theory of structural differentiation. The analysis 
employed a between-organization design with the 1977 and 1983 data. In the 
first approach, variables were transformed logarithmically. These transformed 
variables bear the prefix LN. The gain scores (e. g., LNSIZE3,; — LNSIZE;;) 
represent proportional differences in the between-organization equations. In 
the second approach, we used absolute numbers for simple gain scores (e. g., 
SIZEg3 — SIZE;;). As did previous research using gain scores or proportional 
change scores (Dewar & Hage, 1978), this study controlled for the initial level 
of the change variable. In addition, for both approaches the dependent vari- 
able was lagged so that the only potential variance to be explained was the 
change in the dependent variable that occurred between 1977 and 1983 (cf. 
Werts & Linn, 1970). - 

Naturally, the lagged dependent variable was the largest predictor in all 
equations. Otherwise, neither the levels nor the changes in the independent 
variables had appreciable effects on the changes in differentiation or 
administration. 

An examination of the distribution of slopes (Table 2) gives some indica- 
tion of possible reasons for the lack of effects produced by the change vari- 
ables {gain scores) in the between-organization longitudinal data equations 


TABLE 4 
Between-Organization Equations for Growing and Declining 
Organizations® 





(a) Growing Organizations 

(1) LNDIFFas = .91[.93] LNDIFF77 + .09[.07] LNSIZE;;—.00[.02] (LNSIZEss - LNSIZE;;) 

(2) DIFFas = .80 DIFFz; + .18 SIZE; + .05 (SIZEsa —SIZE;;) 

DI LNADMINs; = .83[.83] LNADMIN;; + .05[.03] LNSIZE;, + .03[.07] (LNSIZEa; — LNSIZE;;) + 
.03[.02] LNDIFF,, + .04[.11] (LNDIFFs4 - LNDIFF;;) 

(4) ADMINgs = .84 ADMIN: + .12 SIZE; + .06 (SIZEg3—SIZEz7)—.07 DIFF;;— 
.02 (DIFF,3;—DIFF77) 


(b) Declining Organizations 
(5) LNDIFF,, = 1.01[1.09]b LNDIFF;; — .07[.05] LNSIZEz; + .08[.36] (LNSIZEs; —LNSIZE;;) 
(6) DIFFs3 = .98 DIFF7; + .02 SIZE; + .06 (SIZEs; — SIZE;;) 
(7) LNADMINg; = .81[.70] LNADMIN;; + .01[.01] LNSIZE;; — .11[.32] (LNSIZEas -LNSIZE;;) + 
.04[.03] LNDIFFz;; + .13[.30] (LNDIFFs; -LNDIFF;;) 
(8) ADMIN: = .70 ADMIN;; + .12 SIZE; — .09 (SIZEs4—SIZE;;) + .01 DIFF;; + 
.01 DIFF; + .09 (DIFFs3 — DIFF;;) 


aMetric (unstandardized) slopes are shown in brackets. All other slopes are standardized. 

>The predicted slope fell out of the range of the data largely due to the presence of an out- 
lier and not because of a lack of variance in differentiation. Over 94 percent of the organizations 
had variance in differentiation over time. The mean change in differentiation between 1977 and 
1983 for declining organizations was loss of 2.3 departments. 
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(Table 4). Table 2 shows that many organizations had relationships that 
produced regression slopes in directions hypothesized by Blau’s theory. Yet 
there were almost always as many, if not more organizations, that had oppo- 
site effects. These results thus cancelled each other out in the overall esti- 
mates of change produced by the between-organization longitudinal data. 
Cross-sectional models that summarize estimates of change for a sample of 
organizations seem likely to show little or no effects when subsets of a 
sample show opposite processes of change. 


Conditional Relationships in Blau's Theory 


The divergent types of change shown by the within-organization analy- 
ses suggested that a search for moderating effects (conditional relationships) 
might be useful to specify further the conditions under which the theory of 
structural differentiation had more or less support. Exploratory analyses 
investigated the theory’s basic relationships for growing and declining orga- 
nizations that were (1) under either public or private control, (2) large or 
small, and (3) rapidly or slowly growing. In general, results did not show 
moderating effects except to suggest somewhat greater support for Blau’s 
theory under conditions of rapid growth. The sample size was not big 
enough to analyze the distribution of slopes across these classifications, 
however. 


DISCUSSION 


In general, cross-sectional relationships were consistent with previous 
research and supported the theory of structural differentiation. Longitudinal 
analyses failed to support the theory. A detailed consideration of the within- 
organization findings suggests some possible reasons for this lack of 
convergence. 

The major advantage of studying change using within-organization mod- 
els rather than between-organization models is that the within-organization 
models show the processes of structural change in detail. This level of detail 
allowed us to examine hypotheses derived from the theory of structural 
differentiation concerning growth and decline. These hypotheses were not 
stated directly in the original theory, but are implicit if Blau’s theory applies 
to change in general. The advantage of analyzing subtleties of relationships 
became particularly apparent when we considered the forms of relationships 
with growth and decline in both size and differentiation. 

A reduction in differentiation was expected with a decline in size. This 
occurred in over 60 percent of the universities. However, in a few it hap- 
pened proportionately slower than their declines, and some even added 
subunits during reductions in size. We expected administrative diseconomies 
with declines in size. This occurred for some universities (39.5%) through 
either greater or less-than-proportional reductions in administrative staff. 
However, many more added administrators during declines, perhaps to reform 
their organizations and stem the loss of students. Assuming that less differenti- 
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ation would reduce the need for coordination and control and hence reduce 
the need for administrative staff, we anticipated that declining differentia- 
tion would lead to decline in administration size. Again contrary to our 
expectations, over 50 percent of the organizations added administrators when 
their levels of differentiation were reduced. Essentially, Blau’s theory was 
not a good predictor of the changes that occurred among structural character- 
istics during declines of size for these organizations. 

For universities growing in size, neither differentiation nor size of 
administration behaved in line with Blau’s theory. Only a few increased 
differentiation and administrative size by less-than-proportional amounts. 
Many actually eliminated departments and administrators as a reaction to 
increases in size. Growth in differentiation, however, did lead to the expected 
increase in administrative staff for a majority of the organizations studied. 

The within-organization findings for administration also gave some 
indication about how cross-sectional differences may appear through pro- 
cesses unanticipated by Blau. Cross-sectional analyses have often found neg- 
ative relationships between organizational size and relative size of admin- 
istrative staffs. The theory of structural differentiation explains this finding 
as an economy of scale produced by less-than-proportional additions of 
administrators in growing organizations. However, within-organization analy- 
ses suggested that these differences arise over time through processes not 
considered by the theory of structural differentiation. The organizations stud- 
ied here often eliminated administrators during growth and added them 
during decline. Both processes will result in larger organizations having 
proportionately smaller administrative staffs, even though this result need 
not come about through the less-than-proportional administrative growth 
that is a major theoretical point of Blau’s theory. This illustrates at least one 
situation in which the results predicted by the theory can exist even when its 
explanation of processes is incorrect. We observed somewhat similar pro- 
cesses for the relationships of size and differentation. 

The findings for the within-organization analyses also raised an issue 
ignored by Blau’s theory. Using three time lags for organizational size, we 
found no dominant lag structure for the regressions of size of administration 
and differentiation on organizational size. It appeared that organizations 
responded with varying degrees of speed to structural change. Distributed 
lag models with several lags of size and differentiation included in the model 
simultaneously (Kmenta, 1971) may clarify patterns of response. It is also 
possible that lag structures for individual organizations are moderated by 
factors not considered by the theory of structural differentiation. 

Although these findings were derived from a homogeneous population, 
additional analyses not reported indicated that the findings are generalizable 
to all colleges and universities. An earlier phase of this research project— 
using a random sample of colleges and universities, a more inclusive mea- 
sure of administrative size, and enrollment as the measure of organization 
size—produced similar findings. 
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CONCLUSIONS 


A comparison of the findings for the cross-sectional analyses and the 
between- and within-organization longitudinal models suggests that the the- 
ory of structural differentiation offers better explanations of scale than of 
change. That is, the theory better explains differences in scale among organi- 
zations than it explains how individual organizations change. Cross-sectional 
regression equations showed strong support for most aspects of Blau’s theory, 
but both between- and within-organization longitudinal models suggested 
little support for it. Additionally, although within-organization analyses dem- 
onstrated that the vast majority of the organizations did change over time, 
the patterns of change diverged from those predicted by Blau’s theory. In 
essence, we conclude that these data fail to support the theory of structural 
differentiation as a theory of change. They suggest instead that it be consid- . 
ered a theory of scale. 

One explanation of why Blau’s theory can account for differences among 
organizations but not for changes within organizations may lie within the 
theory itself. Several of its propositions state that increasing size has a declin- 
ing effect on other variables—for example, on the administrative economy of 
scale. Given these propositions, it would seem that most organizations would 
reach a point where the declining effects of size would become negligible 
after a sufficient period of growth. As a result, differences in scale would 
remain among organizations after they grew to some absolute level of size, 
but future change would have little effect. In a sense, there would exist a 
threshold; at a given level of size, organizations would develop certain 
characteristics. Under such conditions, cross-sectional tests would confirm 
Blau's theory if most organizations observed had histories of growth—a likely 
situation for most cross-sectional studies because organizations with histo- 
ries of decline are not likely to have survived at the same rates as growing 
organizations. Population ecologists might also see other factors as determin- 
ing cross-sectional differences among organizations. Such factors as norms 
of organizational structuring at the time of founding and size at birth would 
account for differences among organizations that do not result from later 
organizational change. Thus, we suspect that the temporal wording of Blau's 
theory is misleading. It appears that reaching a certain magnitude or level of 
size, not changes per se, leads to marginal effects on differentiation and 
administrative economies. The data comparing the three basic approaches to 
the study of the theory of structural differentiation seem to support this 
contention. 

The theory implies rational organizational behavior by managers respond- 
ing to growth or to differences in scale among organizations: Increasing 
differentiation in response to increasing size presumably takes advantage of 
the benefits of specialization. More administrators are then necessary to su- 
pervise more subunits and to manage the increasingly complex integration 
of the units. Differentiation aside, increased size alone increases problems of 
coordination and control and in turn produces a need for a bigger adminis- 
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trative staff. However, once basic administrative functions are in place, or- 
ganizations only need to add staff marginally and thus achieve administrative 
economies of scale. 

Although our findings indicated that Blau’s theory did not explain the 
relationships observed, these relationships may have resulted from decision 
making equally as rational as that theory presumes. Adding administrators 
during a decline phase could be an attempt to offset decline with more 
administrative expertise. Declining universities might for example need more 
effort put into student recruitment and alumni solicitation. In addition, 
reduced departmentalization and resultant elimination of some department 
chairpeople necessitate new high-level administrators to perform administra- 
tive tasks formerly done within departments. Through reducing administra- 
tive overhead and numbers of departments, many growing organizations in 
our research sample were even more economical than anticipated. This high 

level of economy may be a reaction to the environmental changes of the 
1970s. Inflationary pressures and the knowledge that many colleges and 
universities were declining might have inspired organizational decision mak- 
ers to shape lean organizations ready to deal with actual and potential losses 
of resources. However, although changes in size alone do not increase 
organizational fragmentation or need for administrators, the logistics of man- 
aging more subunits does have its expected effect on need for administrators. 

Rational explanations for why the majority of organizations studied did 
not follow the processes of change implied by Blau's theory are essentially 
hypotheses of moderating or contingent relationships. Our results bring to 
mind earlier studies challenging the appropriateness of one model of 
bureaucracy, such as the Weberian, for all situations (e. g., Stinchombe, 1959). 
Rational reactions, in terms of administrative needs, to changing organiza- . 
tional size and differentiation may be consistent with the theory of structural 
differentiation under some conditions, but equally rational reactions may 
fail to support the theory under other conditions. Our brief exploration of 
moderating effects suggested that rapid growth leads organizations to behave 
in a manner congruent with Blau's theory. These organizations may be 
approaching the levels of size and differentiation that produce the associa- 
tions with each other and with size of administration observed with cross- 
sectional data and hypothesized by his theory. 

Of course, rational behavior does not completely govern organizations, 
nor do organizational decision makers accurately perceive their environ- 
ments in all conditions (Astley & Van de Ven, 1983). Misperceptions of an 
environment (Rubin, 1979), historical traditions (Clark, 1972), and internal 
and external political behavior (Pfeffer & Salancik, 1978) are all possible 
moderating or intervening variables that could alter the relationships hypothe- 
sized by Blau for size, differentiation, and administration. In organizations 
that are in decline or experiencing scarcity of resources, power rather than 
reason may have more influence on decisions regarding retention of adminis- 
trators or the elimination of departments (Ford, 1980b; Hannan & Freeman, 
1978; Pfeffer & Salancik, 1974). 
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Although the data examined here did not support Blau’s hypotheses 
when applied to changes within organizations, future replications of this 
research will be necessary for a more conclusive refutation of the theory of 
structural differentiation as a theory of change. The general lack of strong 
support for the theory shown by the within-organization analyses may indi- 
cate that longer lag periods should be investigated. However, as a theory 
derived from cross-sectional data, Blau’s theory fails to suggest appropriate 
lag structures. Thus, for future tests of this theory as a theory of change, 
researchers and theorists must specify the time frame in which they expect 
effects of changing size and differentiation to become manifest. Without 
such specification, it can always be claimed that an insufficient lag period 
was observed. This study suggests that for many organizations any effects of 
changes in size may occur, if at all, over the course of more than two years. 

This investigation was limited in focus and generalizability. We studied 
only the three major relationships hypothesized by the theory of structural 
differentiation and used only one type of organization. The within-organiza- 
tion analyses did not consider the theory’s hypotheses concerning span of 
control, homogeneity and average size of subunits, vertical differentiation, 
and feedback effects. As such, future within-organization research is needed 
to assess more fully the theory’s implications for change. In addition, since 
the results of this study pertain only to universities, future researchers should 
consider using other types of organizations. Such investigators might give 
special concern to vertical differentiation because academic organizations 
have limited variance on this variable. Attention might also be given to 
identifying organizational types with varied career options. Managers and 
administrators of nonacademic organizations, not having the typical 
academics’ option of returning to tenured teaching positions, might well 
behave unlike the academic administrators observed in this study—especially 
during decline. 

Finally, in more general terms, it should be noted that the lack of congru- 
ence between cross-sectional and longitudinal findings for Blau’s theory 
does not indicate that one type of research design is preferable, nor does it in- 
dicate that theories of scale are less worthy of concern. Rather, these findings 
highlight the need to develop theories of organizational structure that consider 
both magnitude of scale and degree of change in the variables of interest. 
Empirical tests of such theories will necessarily make comparisons between 
organizations as well as observe changes within organizations. 
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TACTICS OF IMPLEMENTATION 


PAUL C. NUTT 
Ohio State University 


This study identified implementation tactics used by managers in mak- 
ing planned changes by profiling 91 case studies. Analysis revealed 
four types of tactics, variations within these tactics, and hybrid tactics. 
These tactics are described and related to their success rates and condi- 
tions surrounding their use. The study found a 100 percent success rate 
when key executives used an intervention tactic, but observed this tac- 
tic in less than 20 percent of the cases. Both the persuasion and partici- 
pation tactics had 75 percent success rates; persuasion had the highest 
frequency of use, 42 percent, and participation the lowest, 17 percent. 
Both made high demands on resources. Implementation by edicts had a 
43 percent success rate and a 23 percent frequency of use. Intervention 
tactics and their variations were effective for all types of changes 
and under varying levels of time pressure and importance, suggesting 
that managers should use these tactics more often. 


‘Managers, responding to needs and opportunities, revitalize organiza- 
tions by setting in place planned changes that alter their organizations' 
products, services, internal operations, and policies. Researchers have given 
the process of planned change various labels: planning (Churchman, 1979a), 
organizational change and regeneration (Lewin, 1947), and decision making 
(March, 1981). Changes become the proposals for action that managers 
implement. Implementation is a series of steps taken by responsible organiza- 
tional agents in planned change processes to elicit compliance needed to 
install changes. 

Previous research has reported many implementation failures (Bardach, 
1977; Pressman & Wildavsky, 1973; Schultz & Slevin, 1975; Zand & Sorensen, 
1975). These failures can be attributed (1) to individuals or groups who 
attempt to maintain relationships. sustain existing procedures, or retain con- 
trol over valued activities that would be altered, and (2) to misunderstand- 
ings or disagreements about expected benefits (Greiner, 1970; Kotter, 1979; 
March, 1981). People in organizations use gesture, delay, and obstruction to 
contain or block change attempts they find threatening or merely disagree- 
able (Bardach, 1979). To be successful, managers must devise tactics that 
neutralize or at least contain people who delay making essential commitments, 
protect turf, posture, or carry out vendettas. 

This paper reports on a long-term research effort that documented the 
steps or tactics managers take to implement planned changes. The purpose of 
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‘this research was (1) to identify and describe the implementation tactics 
‘managers use, (2) to determine the relative success of these tactics, and (3) to 
identify conditions under which various tactics are used. Profiles of 91 case 

: studies of planned change provided data for this research. I identified imple- 
mentation activities in the cases, classified them according to implementa- 
tion tactics that had distinct procedures, and determined each tactic's suc- 
cess rate. Finally, I explored the relationships between tactics and contextual 
factors to determine if type of change, time pressure, quality of staff support, 
perceived importance, and size of budget for change processes led to the use 
of particular tactics. 


THE STUDY OF IMPLEMENTATION 


Implementation research has devised prescriptions based on logic, devel- 
oped administrative tools, studied factors that influence adoption, and 
explored how changes are put into effect. 

The prescriptive approach (e.g. Bennis, 1973; Kotter & Schlesinger, 1979) 
advocates the use of particular mechanisms to make changes. For example, 
Bourgeois and Brodwin (1984) presented commander, change, collaborator, 
cultural, and coercive models as alternative implementation approaches for 
executives. Nutt (1983) offered a contingency model that defined sixteen 
environments in terms of differences in climate and fitted implementation 
approaches described in the literature to each environment. The prescriptive 
literature has offered many useful ideas for implementation, but these ideas 
are based on logic rather than on data that supports the use of specific 
guidelines, recommendations, and assertions. 

The tool approach considers the value of techniques, such as t-groups, 
that deal with resistance to change and other issues related to implementation. 
For instance, Schein's (1961) studies of brainwashing during the Korean 
conflict demonstrated the value of breaking old social ties and establishing 
new ones in conversion attempts. Daft and Becker identified the roles of 
incentives and other factors that they call “enablers” (1978:8) in promoting 
implementation success. Such studies have identified useful ways for manag- 
ers to support implementation processes. However, the tool approach cre- 
ates the impression that implementation is nothing more than the selection 
and use of departmentalization, reward systems, coordination, and the like. 

The factor identification approach surveys managers to determine fac- 
tors that appear significant and correlates these factors with success and 
failure. For instance, Zand and Sorenson (1975) developed questionnaires to 
elicit managers' rankings of factors they believed to be important. Their 
questionnaire also sought to determine the extent of organizational compli- 
ance with changes, and of internalization or adoption of changes. Research- 
ers have. often identified factors in such questionnaires by stages in a pre- 
scriptive model; for instance, Zand and Sorenson used a model based on the 
work of Lewin (1947) and Schein (1961). Other studies of this type have built 
models that depicted steps believed to be essential in change processes and 
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used these models to elicit factors (e.g., Beyer & Trice, 1978). Such studies 
have found that success seems to be related to a positive balance in managers' 
feelings between desirable and undesirable factors such as confidence, fear, 
and understanding. They have also identified the importance of pivotal actors 
in change processes, and distinguished between intended and actual uses. 
However, Ginzberg (1975), Lucas (1975), and other reviewers have found 
little consensus among many well-executed studies as to which of the hun- 
dreds of factors that have been identified are crucial to success or precursors 
to failure. 

Others have used case studies to search for implementation approaches 
(e.g., Argyris, 1970). In fact, much of what we know about implementation 
comes from field studies of managers attempting to make changes. The ways 
managers carry out implementations reveal their procedures. The case study 
approach assumes that managers learn to be effective through “practice 
heuristics" (Schon, 1983:68) that evolve procedures to deal with implemen- 
tation arid other issues. However, interviews with managers have failed to 
surface explicit procedures (Keen & Scott-Morton, 1978), forcing researchers 
to use indirect methods, such as retrospective reconstruction of events. Investi- 
gators have studied successful and unsuccessful implementation attempts to 
try to derive the procedures used. For example, Pressman and Wildavsky 
(1973) followed the implementation of legislation to detect obstacles that 
were erected and the rationales behind these obstacles in order to suggest 
ways to overcome these barriers and enhance prospects for successful 
implementation. Bardach (1979) construed implementation as a game and 
described steps that promote winning. Many interesting concepts bave 
emerged from this type of study. However, as Keen and Scott-Morton pointed 
out, the idiosyncractic nature of case studies limits researchers’ abilities to 
pin down procedures underlying successful implementations. A much larger 
data base is required to understand the finer points of implementation and 
distill steps that can be unequivocally linked to success. Because of these 
limitations, implementation prescriptions offered by Greiner (1970) and oth- 
ers often take the form of reversing the steps used in failed change attempts. 

Implementation research has provided valuable guidance and identified 
pitfalls and opportunities, but has provided few insights into how managers 
carry out implementation. To extend implementation research, three changes 
in approach seem essential. First, the relationship between developmental 
and compliance-enhancing activities must be considered by researchers 
exploring formulation and goal setting, environmental assessment, option 
development and selection, evaluation, and other stages of change-making 
processes. Many process-based studies have failed to capture this relationship, 
thereby ignoring potentially crucial implementation steps that managers carry 
out during development. Second, investigators should frame change pro- 
cesses broadly so implementation activities can be seen as part of overall 
processes. The backdrop of a change process allows implementation steps to 
stand out in sharp relief, making it relatively easy to identify and classify 
activities undertaken by managers to elicit the support, cooperation, or 
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acquiescence required to install changes. Third, past studies of change failed 
to recognize that, although change processes may continue working their 
way through organizations, particular managers’ involvement with them has 
beginnings and ends. Fourth, some of the practices often used in case study 
research have serious limitations. Several studies (Bower, 1970; Mintzberg, 
Raisinghani, & Theoret, 1976; Soelberg, 1967; Witte, 1972) popularized a 
technique that immerses a researcher into the raw data describing each case. 
This type of research allows patterns to emerge by examining and comparing 
each case to the other cases. This approach is infeasible for large numbers of 
cases owing to the mass of detail that must be considered. The alternative of 
limiting data bases to make them manageable produces conclusions of sus- 
pect generalizability. Researchers using data bases with few cases may fail to 
discover important implementation-related steps or fail to recognize the idio- 
syncratic nature of the steps they do discover. Alternatively, reseachers can 
impose a framework to search for implementation steps instead of allowing 
patterns to emerge. However, imposing a framework may result in losing 
important messages that do not fit classification schema. In this research the 
advantages of systematic description and of using a large data base were 
thought to outweigh the possibility of lost information. 


THEORETICAL FRAMEWORK 
Implementation 


Managers use implementation to make planned changes in organiza- 
tions by creating environments in which changes can survive and take root. 
Planned changes are proposals for action that alter organizations’ practices. 
A change can be adaptive, calling for installation of known practices (Beyer 
& Trice, 1978); innovative, introducing practices that are new to adopting 
organizations (Damanpour & Evan, 1984); or radically innovative, introduc- 
ing practices new to all organizations in the same business or industry 
(Zaltman, Duncan, & Holbek, 1973). Changes can involve (1) technical inno- 
vations resulting from the application and use of technology; (2) admini- 
strative innovations dealing with relationships and promulgating new rules, 
roles, procedures, or structures within organizations; or (3) adaptation of tech- 
nology or managerial practices used by competing or cooperating orga- ` 
nizations. 

Previous research has defined implementation in terms of actors, intents, 
and procedures (Ansoff, 1984; Beyer & Trice, 1978; Daft & Becker, 1978; 
Zaltman et al., 1973). Implementation is a procedure directed by a manager 
to install planned change in an organization. There is widespread agreement 
that managers are the key process actors and that the intent of implementa- 
tion is to install planned changes, whether they be novel or routine. However, 
procedural steps in implementation have been difficult to specify because 
implementation is ubiquitous. Implementation has figured prominently as 
part of planned change processes labeled decision making (Mintzberg et al., 
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1976), strategic management (Schendel & Hofer, 1979), organizational regen- 
eration (March, 1981), change (Zand & Sorensen, 1975), legislation and pol- 
icy initiatives (Bardach, 1977), and diffusion of innovation (Waters, Salipante, 
& Notz, 1978). Ansoff made several important distinctions pertinent to these 
processes of planned change, identifying four procedures, called the entre- 
preneurial, exploration, control, and implementation subprocesses. From 
this perspective, implementation can be viewed as a procedure used in a 
planned change process that lays out steps taken to entice stakeholders to 
support changes. A coherent set of steps becomes a tactic used by managers 
to elicit the support, cooperation, or acquiescence needed to insure compli- 
ance with planned changes. Tactics should capture the arranging and 
manuevering steps that managers take to deal with social and political issues 
provoked by change attempts. In this study emphasis is on identifying and 
describing tactics and not on measuring behaviors and discrete events such 
as numbers of meetings, analysis of minutes, timing of change processes, 
or jockeying of stakeholders. 


A Transactional Path Model 


To determine how managers who sponsor changes promote compliance, 
this study examined how responsible agents regulate and control a process 
of planned change. Because steps taken to promote compliance can be found 
in any stage of a change process, this study treated each change process as a 
window through which implementation tactics can be viewed. 

The framework used to describe the planned change process was devel- 
oped by synthesizing stages from related processes (Nutt, 1984a). The recom- 
mended staging of activities was derived from the literature on innovation 
(Zaltman et al., 1973), policy formation (Dunn, 1981), organizational change 
(Hage & Aiken, 1970), planning (Nutt, 1984a), and decision making (March, 
1981). The specifications for stages and essential information in each stage 
were distilled for these processes.’ Figure 1 shows the planned change pro- 
cess as a series of transactions, instead of the static events often represented 
in similar research (Mintzberg et al., 1976). The arrows in Figure 1 show the 
direction of information flow between a sponsor-manager in the decision 
mode and a support team? in the developmental mode. Development can 
involve assessing environments to find niches ripe for exploitation; analyz- 
ing causal factors lying behind poor performance; offering ways to restruc- 
ture products or services that overcome performance problems; making lists 
of equipment or raw material vendors that meet preset criteria; providing 
assessments of potentially viable options like vendors, sites, programs, and 
so forth. This transactional representation has an action-taking character that 


1 Readers interested in a detailed discussion of how these stages map to the stages used by 
others are referred to Nutt (19842). 

2 Support teams can include technical staff, other managers, or even sponsors acting as their 
own technical advisors. The make-up of support teams working in the developmental mode can 
shift with every process stage, or a fixed relationship, such as sponsor-controller, can be main- 
tained throughout a change process. 
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FIGURE 1 
The Transactional Planned Change Process ab 
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b Arrows show direction and type of information flow from authorization, shown as circle, 
in the decision mode to development stages, shown as boxes, in the developmental mode 
of the transactional process. 


describes how managers intervene during development to promote imple- 
mentation and deal with other issues. The model is transactional because it 
identifies information that support teams report when carrying out each 
stage of a change process. Managers or sponsors receive this information and 
make decisions as change processes evolve. They take action through these 
decisions. 

Transactions. In stage I, formulation, a support team in the developmen- 
tal mode clarifies the needs and opportunities a manager stipulates in the 
decision mode. The support team identifies problems and proposes objectives. 
In stage II, concept development, a sponsor states his or her premises, which 
identify ways to deal with the problems or respond to the objectives identi- 
fied in stage I. The support team responds by offering one or more options, 
which the sponsor considers. In stage III some of the options are selected to 
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be detailed and offered to the sponsor, who tests them for omissions, 
misconceptions, and errors. Alternatives that can be modified to overcome 
the sponsor's objections are subject to evaluation in state IV. The sponsor 
identifies criteria and the support team applies them to assess the merits of 
alternatives that survive stage HI. In the installation stage the sponsor- 
manager applies tools such as rewards and incentives, personnel selection 
and promotion, resource allocation, sanctions, coordination, and delegation 
to put changes into operation. Stage V begins with an attempt to set in place 
a preferred option and ends when field performance is judged to be adequate. 

Paths. The path that transactions take through the model trace the steps 
in a change process. There are two types of traces (Nutt, 1984a:442). A 
normative trace identifies what theorists believe should take place during a 
change process (Churchman, 19792); it indicates information exchanges and 
directionality of the exchanges that theorists consider ideal, as shown in 
Figure 1. A descriptive trace profiles paths actually taken. Managers acting 
as sponsors can adopt active or passive roles in process management. A 
passive role would call for delegation. An active sponsor would participate 
in each stage of a change process, regulating it by acting on the information 
provided and offering guidance. Between these extremes lie a variety of 
management approaches. Patterns of information exchanges describe the 
degree and type of control used to promote compliance and show how man- 
agers deal with implementation-related questions. This study analyzed these 
patterns to identify implementation tactics. 


METHODS 


This research used a multiple case study approach to identify imple- 
mentation tactics because it captures the best features of both case study and 
factor identification approaches: firsthand knowledge and rich descriptive 
information, and the ability to quantify and generalize. The change process 
framework provided a common standard for representing cases. Cases were 
described using this framework to permit a search for tactics related to 
implementation. Information describing contexts was also collected. Contex- 
tual factors were used to specify the motivations and conditions associated 
with each case so implementation tactics identified in the study could be 
related to these motivations and conditions. Lastly, the study linked 
outcomes— success or failure—with tactics. 


Process Reconstruction 


The research identified the implementation tactics of managers by recon- 
structing the nature and sequence of activities found in the cases. This 
approach is similar to meta-analysis (Glass, McGaw, & Smith, 1982), which 
attempts to integrate findings from independent studies, using each study as 
a unit of analysis. In process reconstruction, cases are the units of analysis. 
However, process reconstruction differs from meta-analysis in that random 
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sampling of cases is not used because such an approach to case selection 
is not feasible. 

Process reconstruction required three key activities. The first was to 
conduct interviews with people who were intimately involved with the 
details of each change. These interviews identified critical steps, deter- 
mined the order in which these steps were carried out, and fitted sequences 
of steps to a classification framework to represent key transactions in each 
planned change process. The second step was to isolate activities under- 
taken to elicit or promote compliance in these transactions. The third was to 
analyze the content of cases with similar and dissimilar transactional 
patterns to distill these compliance-related steps into tactics and to break 
tactics down into categories called variations. 


Pattern Identification and Classification 


Identification of cases. Cases came from organizations that partici- 
pated in student intern programs. All were service organizations like hospitals, 
government or nonprofit agencies, charities, and professional societies, so 
the generalizability of this study's findings may only extend to such service 
providers. Because the organizations were widely dispersed across the United 
States and Canada, a broad range of planned change practice was likely 
represented.? 

A chief executive, operating, or financial officer (CEO, COO, or CFO) of 
each of the 91 organizations was contacted by the author and asked to partici- 
pate in a study that would profile one of their changes.* All agreed to 
participate. The sanctioning executives were asked to select a change that 
had considerable importance for their organizations; no other guidelines for 
case selection were offered. Allowing contact persons to select the cases 
helped to ensure that participants would be cooperative. When executives 
offered several options, change attempts that added diversity to the data 
base, both in terms of case types and outcomes, were selected. 

Case-related data collection. Multiple interviews were conducted in 
each organization to minimize memory distortion and memory failure, the 
two most common errors in reconstructing events (Bartlett, 1954). Each organi- 
zational contact named two people who were intimately familiar with each 
selected case for in depth interviews. One interviewee was the sponsor, the 
manager who had been administratively responsible for the change. Separate 
interviews with the sponsor and one other participating executive were 
conducted.? In the unstructured portion? of an interview, the interviewee was 


3 The cases in this study, as in most multiple case studies (e.g., Mintzberg & Waters, 1982), 
do not represent a random sample. A positive bias may be present because organizations that 
participate voluntarily are likely to share information about change attempts they believe were 
high quality. 

^ In the interviews the author asked s executive to identify-a project in which a change 
was attempted. 

5 The author pilot-tested the — in face-to-face discussions and then carried it out by 
telephone. 

$ 'The next section of this paper discusses the structured portion of the interview. 
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asked to spell out the sequence of steps that had been taken. The information 
gleaned from each interview was then summarized by the author using the 
transactional path model. A path through the model representing the sequence 
of events recalled by each executive was sketched to provide a picture of 
each planned change process. The resulting transactional description 
was then presented to that executive to verify that it accurately represented 
the process as he or she recalled it. The transactions and their path were 
modified until the process representation was acceptable to the informant. 
No more than a week was allowed to elapse between interviews and verifica- 
tion of corresponding path models. 

The processes sketched by the two executives were next E and 
differences identified. An additional interview reconciled any difference 
that occurred. In this interview, the executives were asked to modify the 
transactions until a consensual profile emerged." When differences persisted, 
others involved in the same change process were interviewed. This inter- 
view and other data sources like reports and records were used to determine 
which version of events seemed the most plausible. If disagreements per- 
sisted or if a case lacked detail, that case was to be discarded, but none were. 
All 91 cases were retained for the classification phase of the study and all 
met two tests: (1) participants agreed on what had been done, and (2) sufficient 
detail existed to explain what had been done. Data collection and reduction 
required about ten hours per case. Data were collected over a period of seven 
years. 

A sample profile. Figure 2 illustrates the type of detail captured in a 
transactional representation of a planned change with one of the cases 
in the data base. The direction of the arrows in Figure 2 shows the flow 
of information through the decision mode. The PBX case began when the 
organization's telephone operators filed a series of grievances concerning work- 
load (step 1). Discussion with Bell system technical staff revealed that the 
work demands made on the operators were 20 percent below that expected 
by Bell for their own operators. With this information in mind, the sponsor, 
who was the CEO, asked an operations research staff specialist to rectify the 

. Situation. The specialist could make any change that did not increase operat- 
ing costs (step 2). At the specialist's request, the sponsor delegated the choice 
of the terms of work to the PBX operators, making them a part of the decision 
mode (step 3). The specialist asked tbe operators to identify schedules, with 
the understanding that any schedule could be adopted as long as it did not 
increase costs. A queuing model was used to evaluate the cost of each pro- 
posed schedule in step 5. The seventh option proposed was found to have 
acceptable levels of both cost and satisfaction; no attempt was made to find 
the least-cost or optimal schedule. In step 6, this recommendation was pre- 
sented to the CEO. In step 7, a disgruntled supervisor who had not been in- 
volved, reluctantly put into effect the schedule the operators and management 


? When differences emerged, the executives were sent the path models and met to work out 
their differences. 
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had agreed on. After the new schedule was implemented the grievances 
stopped, ending the process in step 8. The manager-sponsor was satisfied 
when the grievances disappeared and expressed no interest in isolating their 
cause. l 

Sorting the profiles. In multiple case study research, reliable classifica- 
tion is crucial (Yin, 1981). In an attempt to identify categories that have 
classification precision, the 91 profiles or transactional path models were 
repeatedly sorted until distinct categories of implementation-related activi- 
ties emerged. To make these sorts, two criteria were applied: (1) convergence 
and divergence, and (2) the importance of the distinctions (Campbell & Fiske, 
1959). The convergence-divergence criterion was met when a classification 
had explicit similarities within categories and explicit differences among 
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categories. The importance dictum required that the distinctions among 
categories have practical as well as theoretical importance (McKelvey, 1978). 

The transactional path model for each case was reviewed and placed 
with others that used similar implementation steps. Profiles with ambiguous 
classifications were reviewed again in an attempt to create new categories or 
to match existing ones. After each sort, a code number designating a category 
was placed on the back of each transactional path model to identify it. The 
sorting process was repeated a year later without reference to the first sort to 
check intrarater reliability. These classifications matched perfectly. When a 
second rater sorted the profiles to check interrater reliability, agreement 
between raters exceeded 90 percent, suggesting that the categories identified 
were defensible. 


Measuring Effectiveness and Conditions of Use 


Data collected in the structured portions of the interviews with execu- 
tives allowed preliminary assessments of the merits of implementation tactics. 
Descriptions of the contexts of cases determined conditions under which 
each implementation tactic had been used. In addition, each case was classi- 
fied as to type and as to success or failure.? The data on contextual factors, 
types of changes, and success rates in Table 2 allowed a preliminary assess- 
ment of the effectiveness of tactics under the conditions present when the 
tactics and variations were used. 

A review of each case determined its type; three categories emerged from 
this analysis: In equipment cases, organizations acquired major pieces of 
equipment. Construction cases had the primary intent of altering the physi- 
cal features of key facilities. Program cases provided new services or internal 
operations and had less than one-quarter of their cost allocated to equipment 
or construction expenses. 

The remaining contextual factors were identified with a survey that 
participating executives filled out at the end of their interviews. The survey 
asked respondents to indicate whether change attempts had been adopted or 
rejected? and to specify values for the contextual factors. The survey asked 


8 This procedure assumes that a sponsor can define success without using criteria beyond 
“put into use." The cases bear out this assumption. For example, a new burn center was a failure 
if the center failed to open; a management information system was not adopted when the 
organization continued to use its manual system; and a merger was termed a failure when it was 
not completed. 

Measures such as frequency of use and extent to which changes become institutionalized 
could reveal insights into the degree of adoption. Attempts to collect information for these 
measures of status proved to be infeasible. For example, when a change in organizational 
structure was attempted, executives found it impossible to estimate how often the old reporting 
relationships had been used. To measure frequency of use would have demanded on-site 
observation, which was infeasible in a retrospective study. Measures of the degree to which 
objectives were met were also attempted. Initial and final estimates of performance gaps, such 
as cost or utilization expectations compared to realizations, were collected but proved to have 
questionable reliability. Records laying out expectations were seldom available making the 
proportion of objectives realized impossible to gauge. Subjective estimates always moved in 
favorable directions, even when changes were unsuccessful, making them seem suspect. Dis- 
agreements among the executives on these estimates were common and seldom fully resolved. 


1986 Nutt 241 


questions using anchored rating scales to determine importance, staff support, 
time pressure, and process budgets. Executives were asked to estimate a 
change’s importance on a scale ranging from 4 = critical to 1 = routine; the 
quality of staff support available on a scale ranging from 4 = good to 1 = poor; 
and the time pressure to complete the change on a scale ranging from 4 = very 
high to 0 = none. They were also asked to compare the funds allocated to carry 
out development with funds for other change attempts they had managed. 
This scale ranged from 5 = far above to 1 = far below. 

The estimate-discuss-estimate procedure was used to improve the preci- 
sion of these estimates (Gustafson, Shukla, Delbecq, & Wallster, 1973). First, 
executives filled out surveys. After tabulation, discussions of the average 
values for time pressure, need to act, staff support, and level of process 
support brought mutual understandings about the processes to the surface. 
After discussion, executives filled out the surveys a second time. The analy- 
sis used values from this second survey. 


Analysis 


Statistical analyses of the data treated two kinds of tactic variables— 
tactics and tactical variations, which were subclassifications within tactics— 
as factors. Levels for each of the tactical types were correlated with the other 
factors. Analyses related success rate, perceived importance, staff support, 
time pressure, and process budgets to the tactic variables. Ten one-way 
ANOVAs were performed, each using one of the two kinds of tactic variables 
and one of the context or outcome variables. Both parametric and nonpara- 
metric analyses followed an ANOVA format, using F tests and Kruskal- 
Wallis chi-square approximations, respectively. Similar significance levels 
for these tests would suggest that the values for the dependent variables meet 
interval-scale assumptions. Duncan multiple range tests were used to isolate 
differences in dependent variables associated with levels of tactic variables.!? 


IMPLEMENTATION TACTICS USED BY MANAGERS 


Several distinct types of implementation tactics and within-tactics varia- 
tions emerged from the analysis. The nature and degree of their sponsors' 
involvement in process management were found to be distinguishing fea- 
tures of these tactics. Four implementation tactics were used in 93 percent of 
the cases. These tactics were called intervention, participation, persuasion, 
and edict. Each tactic had important variations. Table 1 lists the frequency of 
use for each tactic and tactical variation and summarizes their salient features. 
The next subsection lays out the steps used in each of these implementation 


10 In a Duncan multiple range test (DMRT) the equivalent of multiple t tests are conducted 
for each pair of the levels of the independent variable to find tactics that have similar and 
dissimilar values for each dependent variable. An .05 level of significance is used. The DMRT 
controls for the increased probability of generating significant findings by running multiple 
comparisons. 
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tactics and their variations. Cases drawn from the data base illustrate their 
distinctive features, 77 


Implementation by Intervention 


The tactic. Implementation-related steps were found to begin and end in 
stage V for this tactic (Figure 3). Issues surrounding compliance, such as the 
arena of inquiry and ideas that must be considered to win over stakeholders, 
constrained activity in the developmental stages. In these cases, manager- 
sponsors were careful to acquire sanctions to activate and regulate the pro- 
cess, and to make it clear to all participants, such as other managers, who 
had control. 

To initiate change processes, the manager-sponsors became protagonists 
by creating rationales for action in the minds of key people. Appraising perfor- ` 
mance levels (Figure 3, step 1) and demonstrating performance inadequacies 





TABLE 1 
Features of Implementation Tactics 
Frequency 
of 
Tactics and Variations Occurrence Salient Features 
Implementation by . 
intervention 19 Key executives justify need for change. 
Norm test 5 New norms to judge performance 
Feasibility test 7 Ways performance can be improved 
Both 7 Both performance and norms 
Implementation by Stakeholder representatives determine 
participation 17 change features. . 
Token 2 Solution framing with partial participation 
Delegated 13 Solution specification with partial participation 
Complete 2 Solution framing with full participation 
Comprehensive 0 Solution specification with full participation 
Implementation by . 
persuasion 42 Experts attempt to sell a change they devise. 
Consultants 13 Persuasion by consultants 
Internal staff 29 Persuasion by organizational staff 
Implementation by edict 23 Sponsors issue directives requiring adoption. 
Change had sponsor 
significance 6 Sponsor has more at stake than the organization 
Change had organiza- : ] 
tional significance 12 Organization has more at stake than the sponsor 
Both organizational and 
sponsor significance 5 Both have clear-cut stakes 


2 Percentage of all cases. 


11 A list of each case in the data base classified by tactics, variations, and contextual factors 
can be obtained by writing the author. 
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FIGURE 3 
Implementation by Intervention: 
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by applying new norms (step 2) created these rationales. Sponsor-managers 
often cited comparable organizations with better performance to justify new 
performance norms. In addition, they used descriptions of how current 
practices could be improved to demonstrate the feasibility of changes in step 
2. These suggestions defined options that those involved in the change pro- 
cesses had to consider during development (steps 4 and 5). Many interven- 
tionists recognized that users were excellent sources of ideas, so during 
development sponsors often formed task forces made up of users to identify 
inefficient and ill-advised procedures. Task forces offered ideas or acted as 
sounding boards, providing commentaries on changes as they evolved. Spon- 
sors, however, retained the power to veto these recommendations. Change 
processes ended following demonstrations that adopting changes could 
overcome deficiencies in performance (step 6). Performance monitoring (step 
7) followed installation. ; 

In sum, steps in planned change processes using the intervention tactic 
are: 

1. Acquire the authority to manage a change process and appraise 

performance. 
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2. Apply new norms to identify EES inadequacies. 
3. (a) Justify the new norms, or 
(b) demonstrate the feasibility of improving practices. 

4,5. Development. 

6. Demonstrate improvement in performance. 

7. Monitor performance. 

The intervention tactic occurred in 19 percent of the cases, making it the 
third most frequently used approach to implementation (Table 1). 

Variations. Tactical variations arose in the degree of justification that 
managers used. The steps in justification involved validating new norms, 
showing that change was possible, or both. These variations on intervention 
tactics will be called feasibility tests, norm tests, and dual tests. 

Feasibility tests involved showing how organizational practices could 
be improved. For example, the manager-sponsor in one of the cases demon- 
strated how the organization's materials’ management procedures differed ` 
from up-to-date practices of inventory control. The feasibility test was used 
in 7 percent of all cases and in about 35 percent of the intervention-managed 
cases (Table 1). 

Norm tests occurred when new standards to judge performance were 
established. For example, a hospital CEO phased out a decentralized approach 
to the management of patient care. Under the decentralized system, an admin- 
istrator located in each hospital ward coordinated its use of supplies and of 
ancillaries like x-ray; its contact with admitting, billing, other wards; and so 
forth. To phase out this department, the manager-sponsor contended that the 
administrative costs of competitors were lower because they avoided dupli- 
cation of effort and diffused the accountability inherent in a decentralized 
system. The study found this variation in 5 percent of all cases and in about 
25 percent of the intervention cases. 

Some sponsors addressed both norms and ways practices could be 
improved. For example, fiscally concerned hospitals were wary of providing 
burn care services because third party payers, such as Blue Cross and 
Medicaid, did not cover all of the casts incurred. To overcome this objection 
one sponsor demonstrated how the cost of burn care could be covered by 
using a combination of sources—endowments, governmental care cost 
reimbursements, private insurance, and cheap resident manpower. This spon- 
sor integrated the demonstration of financial feasibility with a description of 
how successful burn units were operated. The dual test variation was 
observed in 7 percent of all cases and about 37 percent of the cases imple- 
mented by intervention. 


Implementation by Participation 


The tactic. To use participation, manager-sponsors initiated change pro- 
cesses by stipulating needs or opportunities, and then assigning decisions 
for developmental activities (Figure 4, step 1). For example, a sponsor might 
cite the need to reduce excessive sick time or retain accreditation or opportu- 
nities to centralize activities in space vacated by another department. Articu- 
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lating such stipulations (step 2) created objectives with varying degrees of 
explicitness, such as cutting inventory costs by 20 percent or merely reduc- 
ing these costs. In step 3, sponsors selected task force members to represent 
certain points of view or to give information thought to be important. Spon- 
sors delegated responsibility for guiding project development processes to 
task forces. Such delegation specified constraints and expectations and iden- 
tified support staff. For example, in the PBX case (Figure 2) the task force had 
latitude to select any schedule they wished, as long as support staff deter- 
mined that costs would not be increased. In steps 4 and 5, task forces arrived 
at consensus about specifics, such as protocols or procedures. In step 6, they 
made recommendations. The cooptation implicit in this delegation became a 
vehicle to promote compliance in step 7. The process closed with perfor- 
mance monitoring in step 8. 

Cases were classified as participative when task forces had the authority 
to carry out one or more stages of a process and when leaders could not 


FIGURE 4 
Implementation by Participation 
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veto the task forces’ views, as in the PBX case. Sponsors made support teams 
available to provide information requested by task forces. 
Steps in change processes using implementation by participation are: 
1. Sponsors stipulate needs, opportunities, or both. 
2. Sponsors set objectives. 
3. Task forces formed by sponsors who 
(a) identify stakeholders, 
(b) delegate responsibility with a statement of expectations and 
constraints, and 
(c) assign staff support. 
4, 5. Development. 
6. Recommendations made to sponsor. 
7. Acceptance promoted through coóptation. 
8. Monitor performance. 
Participation was the least frequent tactic, with 17 percent of the cases using 
it (Table 1). 
Variations. The extent of involvement and the role of task forces defined 
the key variations in participation-managed implementation, as Figure 5 
shows. Type 4, comprehensive participation, calls for delegation of develop- 
ment to fully representative task forces. This approach has the greatest 
cooptative potential owing to the breadth of the role and extent of participa- 
tion that it allows task forces, whose memberships include all important stake- 
holders, such as users and power centers. Comprehensive participation is 
similar to Likert's System 4 (Likert, 1967). According to Likert, this approach 
creates considerable commitment that makes implementation likely. None 
of the cases used comprehensive participation, suggesting that fully coop- 
tative approaches occur rarely, if at all, in practice. 


FIGURE 5 
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Type 3, complete participation, also calls for full participation but asks 
tasks forces to frame rather than specify developmental details. For example, 
-users might be asked to set directions or offer ideas, with staff specialists 
responsible for developing changes in line with the directions set and ideas 
offered. The Program Planning Method also calls for this type of participa- 
tion (Delbecq & Van de Ven, 1971). Complete participation seems less 
'cooptative than the comprehensive type because it restricts the role of task 
forces. Changes that had long-term effects and required broad-term commit- 
ments used this type of participation. For example, in an ambulatory surgery 
case, all users were involved in critiquing the proposed physical structure and 
procedures. Complete participation appeared infrequently—in 2 percent of 
all cases and just 14 percent of the participation cases. 

Type 2, delegated participation, uses stakeholder representatives to sug- 
gest solutions. Strategic planning groups with members drawn from organi- 
zations' boards of directors and key executives practice this type of partici- 
pation (Steiner, 1979). Much of the social planning initiated in the Great 
Society programs of the 1960s implicitly used delegated participation to 
develop or modify services (Nutt, 1976). In that era, user representatives 
were involved in planning processes to lay out details. The PBX project used 
this type of participation (Figure 2). Not all the operators were involved, but 
participants were allowed to dictate the terms of a new schedule. In cases in 
this study, organizations selected benefit packages for aging programs, de- 
signed laboratories, made contingency plans for strikes, carried out reno- 
vations, and planned service procedures in this manner. In delegated parti- 
cipation, participants were often coopted, but their ability to persuade others 
to go along with changes hinged on whether the prerogatives of nonpartici- 
pants seemed threatened. For exarnple, physicians rejected a seemingly in- 
nocuous plan for a pager system that colleagues had devised because it set 
a precedent of unilateral action. Thus, delegated participation may be less 
effective than the complete type. Delegated participation appeared in 72 per- 
cent of the participation cases and in 13 percent of all the cases (Table 1). 
Apparently most manager-sponsors believed that giving representatives re- 
sponsibility for specifying solutions was sufficiently cooptative to gain ac- 
ceptance. 

The least cooptative form of participation is type 1 or token participation. 
Both the use of representatives and their role seemed to hamper imple- 
mentation. For example, surveys may be used to frame users' problems. But 
problems can be misunderstood unless carefully developed sampling proce- 
dures are followed. If not, those surveyed may be unaware of stakeholder 
needs or misrepresent them (Nutt, 1976). Moreover, the members of repre- 
sentative task forces, although personally committed to changes, may fail to 
convince others. These two limitations suggest that token participation will 
be the least successful type of participation. The cases verified this specu- 
lation. Token participation was not sufficient to ensure adoption of any 
changes. For example, when the partners of a HMO plan disagreed about the 
hospital best suited to offer HMO services, Blue Cross—the most powerful 
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member—unilaterally selected a hospital to salvage the plan. Token partici- 
pation was infrequent, occurring in only 2 percent of all cases and 14 percent 
of the participation cases. 


Implementation by Persuasion 


The tactic. In implementation by persuasion, manager-sponsors made 
little effort to manage change processes and monitor their progress due to 
disinterest, lack of knowledge, or powerful or persuasive protagonists. These 
change processes began with either sponsors or experts stipulating needs or 
opportunities (step 1, Figure 6). In persuasion implementation there is little, 
if any, managerial review. Sponsors allow experts to control development. 
The path in Figure 6 moves around the periphery of the transactional model, 
illustrating the independence of experts, who stressed options and their 
justifications in steps 2, 3, and 4. Activities undertaken to pave the way 
for change were not observed. In step 5 experts presented products for 
approval. They made attempts to sell options that performed best, using 


FIGURE 6 
Implementation by Persuasion* 
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projected benefits to argue for adoption in step 6. Sponsors took passive 
roles, encouraging sales pitches from interested parties so they could weigh 
imperatives to act. Sponsors became active after sales pitches had been made, 
often demanding extensive documentation of benefits that required return- 
ing to the evaluation stage. Performance monitoring closed the process in 
step 7. 

Steps in change processes using implementation by persuasion are: 

1. Sponsors stipulate needs, opportunities, or both, or accept need 

or opportunity stipulations from experts. 

2, 3, 4. Development. 

5, 6. Interested parties use persuasion tactics to sell ideas. 

7. Monitor performance, 

Persuasion, the most frequently applied implementation tactic, occurred in 
42 percent of the cases (Table 1). 

Variations. The key variation in implementation by persuasion involved 
types of expert. Observation revealed two types of distinctions among experts. 
First was a distinction between content and process experts. Content experts 
were familiar with topics or systems; process experts sold procedures pre- 
scribing how to carry out processes and had no specific outcomes in mind. 
Content experts included consultants who sold turn-key plans to sponsors. 
Process experts were used when development was entrusted to organizational 
staffers thought to be skilled in procedures. This process-content distinction 
was often blurred in the cases, in part because sponsors encouraged experts 
to stress content (Nutt, 1984b). 

The second distinction was between consultants and internal staff. Inter- 
nal staff carried out development in 70 percent of the persuasion cases and 
29 percent of all cases, making this variation the most widely used of all the 
variations in implementation tactics. When internal staff carried out processes, 
sponsors believed their organizations had experts skilled in either procedure 
or content. For example, a sponsor asked a staff member to study existing 
family-practice outpatient programs and recommend one to their organization. 
Internal staff purchased equipment, developed management procedures, 
planned construction and renovations, and designed accounting and other 
systems. 

Organizations hired consultants to carry out processes in 30 percent of 
the persuasion cases. Consultants who were experts in specific content areas 
were used when manager-sponsors had little knowledge of or expertise in 
topics. Organizations also used consultants for projects that involved high 
technology, such as high-risk neonatology centers and information systems. 
Consultants with process expertise were used to develop construction plans 
and carry out specialized developmental activites, such as site selection and 
evaluations. 


Implementation by Edict 


The tactic. Implementation by edict involved sponsors’ using control 
and personal power while avoiding any form of participation. This tactic 
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had three key features. First, sponsors’ control of change processes was 
intermittent with no common theme. There was no recognizable pattern in 
sponsors' monitoring and no formal delegations of responsibility. Experts 
and users had little power. Second, sponsors did not discuss changes with 
users or attempt to rationalize the need for changes; they expected user 
compliance. They demonstrated neither imperatives for changes nor their 
feasibility. The justification steps found in the intervention tactic were not 
observed. Third, sponsors issued adoption directives by managerial fiat even 
when lengthy developmental processes occurred. Sponsors merely announced 
changes and prescribed expected behavior using memoranda, formal pre- 
sentations, or on-the-job instructions that dictated the expected behavior of 
users. Sponsors applying power in this way used implementation by edict. 

The steps associated with edict implementations in change processes 
shown in Figure 7 are: 

1. Sponsors stipulate needs or opportunities. 

2, 3. Development. 

4. Sponsors issue directives. 

5. Monitor performance. 
Edicts were used in 23 percent of the cases (Table 1).. 


FIGURE 7 
Implementation by Edict 
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Variations. The study identified three notable variations in the use of 
power tactics. These variations were based on whether changes had individ- 
ual or organizational significance, or both. Sponsors had personal vested 
interests in several of the edict-implemented changes. Some changes were 
identified with sponsors to the extent that failures to adopt them would have 
hindered these managers’ upward mobility. For example, a hospital CEO, 
hoping to alleviate disturbing malpractice incidents, announced a patient 
relations department by having the head of this new department make a 
presentation to key executives. Other sponsors saw themselves as acting in 
their organizations’ best interests by issuing edicts because they thought the 
changes had special significance. For instance, a sponsor used a memoran- 
dum to announce a new pricing policy for an emergency room, hoping to 
stem recent losses of business to freestanding urgent care centers. Still others 
believed that their interests and their organizations’ interests were syn- 
onymous. In the case of a helicopter transport system, for example, both the 
plans of a key competitor and the CEO's pet idea provided imperatives to act. 


In about 25 percent of the cases of implementation through edicts and in 
6 percent of all cases sponsors had more at stake than their organizations. 
Changes had clear-cut organizational significance with modest or minimal 
implications for sponsors in half of the edict cases and 12 percent of all 
cases. Organizations and sponsors shared equal stakes in changes in about 
25 percent of the edict-implemented cases and in 5 percent of all cases. 
Sponsors were found to have important stakes in half of the cases of imple- 
mentation by edict. 


Combined Tactics as Hybrids 


In hybrid cases, sponsors switched implementation tactics. Two or more 
of the basic tactics appeared in these cases. For example, some sponsors took 
responsibility away from experts, creating persuasion-intervention hybrid 
tactics. Other combinations included participation-intervention and per- 
suasion-participation. The tactics used, their order, and their relative emphasis 
all seem important. However, there were too few hybrid cases to show if the 
use of combinations meant managers were groping for correct tactics, with the 
last tactics used being their choice, or if the combinations themselves gave 
improved leverage. Hybrids may signal managerial errors or the emergence 
of different implementation needs at different stages of development. Only 
7 of 91 cases, or 7 percent of the cases in the data base, were classified as 
hybrids, suggesting that the type is rare. 


SUCCESS OF TACTICS AND CONDITIONS OF USE 
Adoption Rates 


The study determined tactic success by treating the tactic options in 
Table 1 as independent variables and adoption as the dependent variable. 
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ANOVA results indicated that both tactics and tactical variations were corre- 
lated with adoption (p < .005 and p < .05, respectively). 

The intervention tactic had a 100 percent success rate, followed. by par- 
ticipation with 84 percent, persuasion with 73 percent, and edicts with 43 
percent. A Duncan multiple range test found that the success rates for the 
intervention and edict tactics were significantly different. The persuasion 
and participation tactics were similar but distinct from the intervention and 
edict tactics (p « .05). This suggests that intervention has the best prospects 
of success, edict the worst, and persuasion and participation similar, 
intermediate prospects. The success rates for the tactical variations fol- 
lowed this ordering: Tactical variations for the intervention tactic had suc- 
cess rates that were better than or equal to those for variations on the partici- 
pation tactics; variations on participation tactics were better than or equal to 
the persuasion variations; and the persuasion variations were better than the 
variations on implementation through edicts. 

The most successful tactical variations were (1) the intervention varia- 
tions, including use of norm, feasibility, or both norm and feasibility tests, 
and (2) the tóken and complete participation variations (Table 2). If adoption 
is considered the measure of success, the intervention tactical variations 
seem clearly superior to all others. The success of token and complete partici- 
pation is difficult to interpret because there were only two cases for each 
type. Delegated participation, or solution-seeking by representatives, had 
success rates similar to those of the persuasion variations, according to the: 
Duncan multiple range test. This decline in rate of success suggests that 
participation may lose its cooptative power unless task forces involve most, 
if not all, of the people whom changes affect. When representatives were 
used, participation became much like persuasion, both in terms of approach— 
representatives selling changes—and likelihood of success. All of the edict 
variations fared poorly, with success ranging from 30 to 60 percent. 


Contextual Factors and Conditions of Use 


Table 2 shows the values obtained for the contextual factors sorted by 
tactic and variation. The analysis of these contextual factors was carried out 
to determine relationships between conditions surrounding changes and use 
of implementation tactics. Specifying these conditions helped to qualify the 
success rates observed for the tactics and their variations. In this analysis, 
perceived importance and time pressure associated with changes did not 
lead to the use of particular tactics or tactical variations, but budgets for 
change processes and staff support did seem to influence managers' selec- 
tion of tactics. f 

When variations of intervention or persuasion tactics were used, process 
budgets were above average (p < .02). Delegated participation, or develop- 
ment of solutions using representatives, had process budget demands below 
those of other changes. Both token participation, or framing of solutions 
using representatives, and implementation by edict for changes with high 
significance to sponsors also had low budget demands. Perhaps sponsors 
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kept the budgets of changes significant to them and implemented by edicts 
low to hide their interests. Large process budgets might draw unwanted 
attention. Cases of token participation may be pro forma attempts at 
involvement designed to minimize costs, which may explain their relatively 
low process budgets. More cases are needed to explain the high success rates 
and low costs associated with token participation. 

Cases that used complete participation, or fully representative solution 
framing, had budgets that were far above those of typical changes. Using task 
forces that were fully representative to define problems or comment on plans 
resulted in large budgets, but was effective, as the 100 percent adoption rate 
for this variation indicates. Costs of involving large numbers of people may 
account for these comparatively high process budgets. Delegated participation, 
with solutions framed by representatives, was less costly but also less effective 
because task force members had to resort to persuasion to sell their ideas. 
Persuasion by several task force members succeeded only a bit more often 
than persuasion by single experts, as the data in Table 2 verifies. 

The quality of staff support was related to the tactical variations (p < 
.05). Staff support perceived to be satisfactory to good characterized all but 
organizationally significant edicts and token participation (framing with 
representatives). Respondents designated support as marginal for organiza- 
tionally significant edicts and as just below satisfactory for token participation. 
This suggests that sponsors using edicts may underestimate the need for 
good staff support when changes have organizational significance but do not 
when changes have personal significance. The personal involvement of spon- 
sors heightens attention and entices them to recruit capable people to help 
carry out changes. When such heightened interest was lacking, sponsors 
used staff thought to be less capable. 

Sponsors seemed to underestimate needs for staff support when they 
used delegated participation, or representatives devising solutions. They 
may have believed that task forces were self-sufficient and failed to see the 
need for staff to support development of solutions. The demands of solution 
development were high here compared to the demands created by framing 
solutions in token and complete participation. 

The study also classified changes as equipment, construction, and pro- 
gram types, as discussed earlier in this paper. Equipment changes often 
captured the attention of key administrators because they typically cost 
$500,000 or more. Development focused on determining the value of equip- ` 
ment innovations or updating. Construction projects involved changes in the 
physical features of key facilities. Development stressed physical features 
and rationales that supported needs to alter, add to, or build new facilities. 
Large budgets of 10 million and more and significance throughout organiza- 
tions were typical. The category program change described installing new 
services or internal operations that involved minor equipment and construc- 
tion expenditures, if any, of less than a quarter of the budget. Most program 
changes had only incidental equipment or construction costs. The emphasis 
was to modify or devise new institutional practices that developed software, 
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procedures, protocols, and the like for services like detoxification units, or 
for internal operations, like security programs. These changes stressed pro- 
cedural development and allocated the bulk of their budgets for this purpose. 

Program changes were the most common, representing 60 percent of the 
cases. About 25 percent were construction changes and 16 percent were 
equipment changes. Unlike Damanpour and Evan (1984), this study found 
no relationship between implementation tactics and type of change. Manag- 
ers do not appear to favor particular tactics to implement program, equipment, 
or construction changes. 


CONCLUSIONS 


This research led to several insights about the practice of implementa- 
tion that offer guidelines for managers and suggest topics meriting further 
research. Key conclusions concern implementation practices and the effec- 
tiveness of implementation tactics under particular conditions of use. 


The Nature of Managerial Implementation 


The intervention tactic. The intervention tactic is similar to the change 
agent prescriptions of Lewin (1947) and Schein (1964), with some important 
distinctions. In practice, change agents were manager-sponsors who took con- 
trol of planned change processes. Consultants were never observed. Steps 
were similar to the Lewin-Schein implementation procedure of unfreezing, 
change, and refreezing (Dalton, 1970). However, managers in this study were 
found to be far more aggressive than organizational development (OD) spe- 
cialists acknowledge as either necessary or desirable. Behaving in line with 
the prescriptions of Kotter and Schlesinger (1979), successful implementers 

` carefully monitored entire change processes, regulating and controlling social 
and political issues as they arose. Managers using the intervention tactic 
were quite good at creating new norms in systems they sought to change. 
They offered new definitions of acceptable performance, justified these new 
norms, and showed how practices could be improved. Through these steps, 
sponsors were able to stipulate needs, thereby eliminating ambiguity, deal 
with scale, self-educate, collect essential data to deflect threats, manage 
resentment, build confidence, and reinforce what was wanted. These are all 
key forces identified by Zand and Sorensen (1975) in their study of imple- 
mentation in the Lewin-Schein process frame. Only one of the managers had 

‘any exposure to the OD literature, suggesting that these practices evolved 
from practice-based heuristics, as suggested by Schön (1983). 

The participation tactics. The origins of participation as an implementa- 
tion tactic can be traced to the classic study by Coch and French (1948), who 
found that people reacted more favorably and became more committed when. 
they participated in change-making processes than when they did not. This 
tactic mandates forming task forces that represent key stakeholders. Manager- 
sponsors delegate their development prerogatives to the task forces. The 
Delbecq, Van de Ven, and Gustafson (1975) approach to planned change is 
typical of many contemporary approaches that make these prescriptions. 
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Managers who used participation tactics were aware of the participation 
literature and often said that they used participation because of this 
knowledge. However, the study observed no instance of full or comprehen- 
sive participation and thus cannot specify the details of such a tactic. 
Typically, managers scaled down the scope of participation. Representatives 
acted as agents for stakeholders in more than 85 percent of the participation- 
implemented projects. In those few cases where full involvement occurred, 
participants had narrow roles. They commented on issues that helped frame 
problems by interpreting their own needs or critiquing proposed solutions. 

The benefits of participation seem contingent on what task forces are 
asked to do and the scope of the involvement of key people. Representatives 
can help, but the power of cooptation drops dramatically for those not directly 
involved. The effects of cooptation decline both when representatives of a 
large group of stakeholders are involved and when the role of a task force is 
limited. All tactics but comprehensive participation have these limitations. 
However, comprehensive participation, or full representation of stakehold- 
ers in devising solutions, was not observed and may seldom, if ever, arise in 
practice owing to its cost. Even the complete participation type, full partici- 
pation of stakeholders for a part of the process, can be difficult to carry out 
because affected parties are often numerous and diffuse. The parties involved 
in a prison siting are an example. The leverage from delegated participation 
stemmed from representatives, but the number of individuals affected by the 
change that could be involved limited the power of cooptation. Participation 
should be more carefully defined in future research to account for participants' 
roles and the extent of their involvement. : 

The persuasion tactic. Operations researchers and others who call for 
experts to guide development activities in planned change processes advo- 
cate a sales approach to implementation. For example, Churchman (1979b) 
argued that implementation depends on anticipating and countering moral, 
aesthetic, religious, and political objections to proposed changes. According 
to these views, implementation should hinge on experts who determine 
what should be done and use rational arguments to convince managers to go 
along. Organizations like the National Science Foundation, the Ford Foun- 
dation, and the National Institute of Health apply this type of implementa- 
tion in their peer review procedures. Experts select the grants and contracts 
they will support—their products. The controversial Operation Research Soci- 
ety of America guidelines, published in 1971, also called for experts to 
vigorously sell ideas they find desirable. As a result, many of the planned 
change processes devised by management scientists and operations research- 
ers have experts recommending and managers reacting. Implementation is 
seen as an educational process (Churchman, 1975). When changes are not 
adopted, failure is often attributed to managers who did not understand 
developmental processes or their products (Churchman & Schainblatt, 1965; 
Duncan, 1974). 

Sponsors extensively delegated development to experts. More than 40 
percent of the cáses used expert-managed development processes. Managets 
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gave experts broad responsibility, but withheld approval until the experts 
could demonstrate value. This forced experts to devise well-constructed 
rationales. The frequency with which persuasion tactics were used to justify 
decisions to adopt changes suggests that many sponsors see development as 
an activity that internal staff or consultants should carry out. Forcing experts 
to gather arguments for adopting changes at the expense of gaining the ac- 
ceptance of stakeholders may account for many of the failures of implementa- 
tion by persuasion. 

Future research should consider how consultants influence persuasion- 
based processes. Sponsors can use consultants or internal staff, but seem to 
prefer internal staff. Consultants play vital roles in many organizations, but 
their activities and the benefits they create are shrouded in secrecy. More 
study of consultants and their role in implementation seems essential. 

The edict tactic. Issuing edicts requires sponsors to use one or more of the 

reward, legitimate, expert, informational, and referent power types identi- 
fied by French and Raven (1959). The authority to offer rewards or to remove 
irritants is called reward power. To apply this type of power, managers 
must have control over financial and nonfinancial incentives that can be 
used to entice others to adopt plans. Legitimate power is based on sponsors’ 
rights to take action in organizations. Expert power flows from a track record 
of successes and a reputation for candid and honest dealings. Expert power 
requires both credibility and trust, but informational power, in which manag- 
ers offer their insights or cite anecdotes, requires only that they have the 
trust of other organizational members. Referent power stems from the attrac- 
tions people have for others that lead them to label these individuals as 
charismatic. Sponsors drew on one or more of these power bases when 
carrying out edicts (Nutt, 1983). For example, legitimate power was applied 
when sponsors used legitimate prerogatives to make changes. When carrying 
out edicts, sponsors trade social credit for action and risk failure if their 
power proves to be insufficient. However, the frequent use of power can 
strain organizations and gradually drain manager-sponsors' stores of social 
credit. : 
Contingency use of tactics. Research is needed to determine if contin- 
gency use of implementation tactics is desirable. All forms of implementa- 
tion were found to have some degree of success. Because some ways to 
manage planned change processes are more costly and less timely than others, 
future research could develop and test conditions in which each implementa- 
tion tactic can be effective, creating a contingency model. 


Tactic Effectiveness and Conditions of Use 


The study identified a variety of interventionist tactics such as challeng- 
ing existing norms, offering viable ideas, or both, and found all to produce 
100 percent success rates. The most common participation tactic—represen- 
tatives providing solutions—implicitly used persuasion-like tactics and had 
success rate$ of about 75 percent, similar to those for persuasion tactics. 
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Edicts fared poorly, failing more often than they succeeded, whether changes 
had personal significance for sponsors, organizational significance, or both. 
The only participation variation that employed full representation of 
affected parties occurred infrequently, perhaps because complete participa- 
tion demanded process budgets far above what other changes required. Gen- 
erating solutions with delegated participation required modest process 
budgets, but was less effective than complete participation, seemingly because 
representatives had to use persuasion to sell changes they devised. Managers 
_underestimated needs for staff support in delegated participation. Apparently, 
sponsors allocate too little staff support when they enlarge the role of task 
forces to include solution development. Token participation, framing with 
representatives, had marginal staff support and small process budgets, sug- 
gesting pro forma attempts at cooptation and efforts to minimize implementa- 
tion costs. 

Edicts motivated by sponsors’ interests had small budgets, apparently to 
avoid drawing undue attention to changes and sponsors’ vested interests. 
When an edict was used for changes with organizational significance, manag- 
ers provided only marginal staff swpport, but respondents termed staff sup- 
port satisfactory for changes of some significance to sponsors. Sponsors appear 
to reserve the best staff for their pet change attempts. 

Success was guaranteed when sponsor-managers were actively involved. 
in applying interventionist tactics. The 100 percent success rates for inter- 
vention and its modest resource demands make this implementation tactic, 
highly desirable and worthy of active promotion among managers. Persua- 
sion was successful in three-quarters of the cases in which managers used it. 
This tactic demanded strong staff support and seemed to require high pro- 
cess budgets compared to intervention. Surprisingly, implementation by per- 
suasion was not limited to changes that sponsors considered of little impor- 
tance and under low time pressure. Managers did not deem only throw-away 
changes appropriate for experts to manage, as some theorists have implied 
(e.g., March, 1981). Participation was found to be no more effective than 
persuasion, and the two required comparable staff and process budget support. 

All of the tactics were used under varying levels of time pressure and 
perceived importance and for each of the change types. This suggests that 
time pressure, importance, and change type do not influence managers’ selec- 
tions of implementation tactics. 

Future researchers should be wary of pragmatic measures, such as the 
one used in this research, rate of adoption. The PBX case demonstrates the 
reason for this caveat. What was the origin of the grievances? At the conclu- 
sion of the change process no one had any idea. The sponsor saw the symp- 
toms disappear and so judged the change a success. But if the origin of the 
grievances were an autocratic PBX supervisor, the grievances might recur. 
What appeared to be a pragmatic change may have merely papered over 
issues that could emerge again in problems seemingly unrelated to the ear- 
lier ones. Implementation has pragmatic importance, but the key step in 
planned change processes may be formulation. Future research should devise 
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ways to measure the likelihood and consequences of incorrect stipulation of 
needs or opportunities by managers. 
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SOCIALIZATION TACTICS, SELF-EFFICACY, 
AND NEWCOMERS’ ADJUSTMENTS 
TO ORGANIZATIONS 


GARETH R. JONES 
Texas A & M University 


This study investigated the relationship between the socialization tac- 
tics employed by organizations and a series of role and personal 
outcomes. It also examined the effects of self-efficacy on role orientation. 
Consistent with predictions, results suggest that different patterns of 
socialization lead to different forms of newcomer adjustment to organi- 
zations. Specifically, institutionalized tactics lead to custodial role 
orientations, and individualized tactics to innovative role orientations. 
The results also suggest that self-efficacy moderates this learning pro- 
cess—specifically that socialization tactics produce a stronger custodial 
role orientation when newcomers possess low levels of self-efficacy. 


Although there is evidence to suggest that socialization experiences affect 
personal and role outcomes, few empirical studies of socialization processes 
have explicitly addressed the question of how specific socialization tactics 
affect newcomers' adjustments to organizations. Rather, the focus has been 
on how newcomers' experiences of organizational reality at different stages 
in socialization processes result in such outcomes as satisfaction and feelings 
of personal worth (Feldman, 1976), commitment (Buchanan, 1974), or lon- 
gevity in organizations (Katz, 1978). Only untested theories have explicitly 
addressed the way in which socialization tactics affect newcomers’ responses. 
A second and related issue is the possibility that individual differences may 
also affect newcomers' adjustments to organizations and may furthermore 
moderate the effects of socialization tactics on individuals' orientations to 
roles (Jones, 1983a). From such an interactionist perspective, it is necessary 
to investigate how both individual and organizational factors combine to influ- 
.ence (1) newcomers' subsequent adjustments to organizations and (2) their 
role behaviors. i 

The purpose of this study was to address these issues. It first examined 
how the information provided by organizations through their socialization 
practices may influence the way newcomers adjust to the organizations. 
Second, it examined the proposition that newcomers’ levels of self-efficacy 
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Ferris, and Cynthia Fisher for their helpful comments on earlier drafts of this manuscript. I also 
wish to thank Jane Nathan and the College of Commerce and Business Administration at the 
University of Illinois for providing resources to conduct the study. An earlier version of the 
manuscript was presented at the Western Academy of Management Meetings, March 1984. 
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may moderate the effects of these socialization practices on their subsequent 
role orientations. Role orientation is defined here as the manner in which 
individuals perform their roles and adjust to task requirements. 


TACTICS AND OUTCOMES 


Newcomers entering organizations may experience a reality shock 
(Hughes, 1958) or surprise (Louis, 1980) when their assumptions about how 
people interpret and respond to actions or events do not conform with those 
that prevail in their new contexts. As a result, newcomers may be forced to 
reevaluate their assumptions about these organizations and to seek informa- 
tion about why people behave as they do, if they are to reduce the uncer- 
tainty or anxiety surrounding the entry process (Jones, 1983b; Van Maanen, 
1977): One of the main ways in which organizations can influence this 
learning process is by using specific tactics of socialization. 

Van Maanen and Schein (1979) offered a theoretical explanation of how 
methods of socialization influence one particular outcome of socialization, 
role orientation. According to their theory, newcomers respond to their roles 
differently because the socialization tactics used by organizations shape the 
information newcomers receive. By giving or withholding information or by 
providing information in particular ways, organizational incumbents can 
encourage newcomers to interpret and respond to situations in a predictable 
manner. Figure 1 presents the six socialization tactics that Van Maanen and 
Schein (1979: 236—252) hypothesized to be significant in influencing 


FIGURE 1 
A Classification of Socialization Tactics? 


a INSTITUTIONALIZED INDIVIDUALIZED 
Tactics concerned 


mainly with: 
Collective Individual 
CONTEXT 
Formal Informal 
Sequential Random 
CONTENT 
Fixed? Variable 
Serial Disjunctive 
SOCIAL ASPECTS 


Investiture” Divestiture 





a Based on Van Maanen and Schein (1979: 232). 
b Indicates reverse of effects hypothesized by Van Maanen and Schein (1979). 
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newcomers’ responses. However, the present study hypothesized that the 
effects of two of these tactics on newcomer response are the opposite of those 
proposed by Van Maanen and Schein. Also, the overall classificatory scheme 
used in this study is different, as is discussed in the following pages. 

The first two types of tactics vary in terms of the contexts in which 
organizations provide information to newcomers. First, collective versus 
individual socialization tactics are hypothesized to produce divergent role 
orientations. With collective tactics, new recruits go through common learn- 

. ing experiences designed to produce standardized responses to situations. In 
group settings interactions among newcomers reinforce the definition of the 
situation offered by socialization agents. By contrast, individual socializa- 
tion practices give each newcomer a unique set of learning experiences that 
allows heterogeneity in their responses. As a result, collective tactics will 
produce custodial role orientations, in which newcomers accept the status 
quo and passively accept the substantive requirements of tasks or roles (Van 
Maanen & Schein, 1979). Individual tactics, on the other hand, provide 
newcomers with opportunities to develop differentiated responses and to 
adopt innovative orientations toward roles. With such orientations, newcom- 
ers may decide to change the methods for performing their roles or even their 
missions. 

The second categorization of socialization tactics, formal versus informal, 
is closely related to the first in that it also deals with the context in which 
information is presented to newcomers. When organizations employ formal 
tactics, they segregate newcomers from other organizational members while 
they learn the responsibilities of their roles. With informal tactics, newcom- 
ers become part of work groups and learning takes place on the job. As a 
result, formal tactics, especially when coupled with collective practices, 
increase the propensity of newcomers to accept definitions of situations 
offered by significant others in their organizations. In turn, this increases the 
degree to which newcomers will share common norms, values, and attitudes, 
and develop custodial orientations. By contrast, informal and individual 
tactics provide newcomers with great latitude to make differentiated 
responses, and innovative responses are a likely result. 

The next two categorizations of tactics, sequential versus random and 
fixed versus variable, deal with the content of the information given to 
newcomers via socialization. Sequential tactics provide recruits with explicit 
information concerning the sequences of activities or experiences they will 
go through in their organizations. Fixed tactics provide them with precise 
knowledge of the timetables associated with completing each stage in these 
processes. By contrast, variable socialization tactics provide no information 
about when newcomers may reach a certain stage in a learning process and, 
when a process is random, they do not know the sequence of its stages. Thus, 
random and variable practices will increase newcomers’ levels of uncer- 
tainty since they are missing information concerning their organizational 
futures. 


1986 Jones 265 


While Van Maanen and Schein (1979) argued that sequential tactics will 
lead to role conformity, they also argued that variable tactics will lead to 
custodial responses because variability causes anxiety that motivates people 
strongly towards conformity. However, it seems more likely that when the 
future is uncertain, variable tactics will encourage innovative responses, 
especially in organizations in which newcomers’ abilities to deal compe- 
tently with uncertain situations govern upward mobility. For similar reasons, 
fixed socialization tactics may not lead to innovative responses, but to custo- 
dial ones, because individuals will have no desire to rock the boat if they can 
clearly see the pathways to their future statuses from the beginning. Thus, 
the effects of fixed or variable tactics on newcomers’ orientations must be 
determined empirically. 

Van Maanen and Schein’s (1979) last two categorizations of socializa- 
tion tactics are also closely related to each other; they are serial versus 
disjunctive and investiture versus divestiture. In serial processes, experi- 
enced organizational members act as role models for new recruits, but in 
disjunctive processes newcomers must develop their own definitions of situ- 
ations because no other or prior role incumbents are available. For this 
reason, disjunctive processes are likely to result in active, innovative role 
orientations. The final tactical category, investiture versus divestiture, con- 
cerns the degree to which newcomers receive positive or negative social 
support after entry from experienced organizational members. These two 
categorizations therefore reflect social or interpersonal aspects of the 
socialization process. In the social, interpersonal sphere, interpretations 
offered by other organizational members may more strongly influence 
newcomers’ perceptions of contexts than do objective characteristics of 
contexts. Fixed or sequential socialization tactics would furnish such objec- 
tive information. Previous research (Van Maanen & Schein, 1979) suggested 
that divestiture also results in custodial responses because organizations 
explicitly intend such tactics to mold newcomers into forms that the organiza- 
tions wish. By contrast, investiture, which reinforces peoples’ beliefs in 
their own competency, results in innovative role orientations. 

Once again, the logic behind this argument is open to question. By 
disconfirming newcomers’ expectations about themselves through negative 
social experiences, divestiture may not cause newcomers to adopt passive or 
custodial role orientations. Instead, it may encourage them to excel in their 
roles. In other words, divestiture may lead individuals to question defini- 
tions of situations offered by others and stimulate innovative orientations to 
roles. Conversely, confirming newcomers’ definitions of their own compe- 
tency at an early stage in their entry processes may paradoxically cause them 
to believe in self-fulfilling prophecies of their own worth that will lead to 
custodial orientations. Thus it is not clear a priori which socialization tactics 
will result in active and innovative or passive and custodial role orientations. 
Furthermore, it is unclear whether different sets of practices, as opposed to 
different ends of the six continua, will lead to opposite role orientations. Van 
Maanen and Schein (1979) suggested that different combinations of tactics 
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will result in different role orientations. However, given the modifications 
suggested above and the lack of previous empirical work, the guiding hypothe- 
sis for this study is that different ends of the six continua will produce 
opposite role orientations. Specifically, it was expected that socialization 
practices that are individual, informal, variable, random, and disjunctive 
and that involve divestiture will produce innovative role orientations. These 
practices will be collectively referred to as individualized socialization tactics. 
Conversely, it was expected that the opposite ends of these continua, collec- 
tively referred to as institutionalized socialization tactics, lead to passive or 
custodial role orientations. 

Hypothesis 1: Institutionalized socialization tactics will 

produce custodial role orientations and individualized tac- 

tics will produce innovative role orientations. 

Role orientation is only one outcome that the process of socialization 
may influence. Other outcomes of importance concern levels of role ambigu- 
ity and role conflict, job satisfaction, commitment, and intention to quit. 
Taken collectively, these outcomes generally describe newcomers’ personal 
adjustments to their organizations. To the degree that socialization tactics 
provide information that reduces the uncertainty or anxiety surrounding an 
entry process, a predictable pattern of relationships between these variables 
should emerge. Specifically, since institutionalized socialization tactics are 
likely to present newcomers with less problems in searching for situational 
consistency and mediating personal adjustment than individualized tactics, 
it is hypothesized that: 

Hypothesis 2: Institutionalized socialization tactics will 
be negatively related to role conflict, role ambiguity, and 
intention to quit, and positively related to job satisfac- 
tion and commitment. 

Although Hypothesis 2 implies that, in general, institutionalized tactics 
are more likely to influence personal adjustments to organizations than indi- 
vidualized tactics, it is likely that certain tactics will have more effect on 
newcomers than others because different tactics provide information in dif- 
ferent ways. For example, collective or formal tactics are likely to be the least 
important types in mediating adjustment, because both essentially deperson- 
alize newcomers and do not allow for variability in their responses to 
situations. Fixed and sequential tactics are likely to be more important in 
this regard because they provide newcomers with content-specific infor- 
mation concerning their future organizational progress. However, investi- 
ture and serial socialization tactics are likely to be most important, espe- 
cially in situations in which information is ambiguous or role learning is 
complex, because they provide the social cues and facilitation necessary 
during learning processes (Feldman, 1976). This argument led to the follow- 
ing hypothesis: 

Hypothesis 3: Investiture and serial methods will be the 
most important among the six categories of socialization 
tactics in mediating personal adjustments to organizations. 
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Self-Efficacy and Role Orientation 


The introduction to this paper argued that newcomers’ levels of self- 
efficacy may moderate the effects of socialization tactics on role orientation. 
Self-efficacy influences people’s expectations about their abilities to perform 
successfully in new situations. Expectations about self-efficacy are directly 
related to people’s perceptions of their success in dealing with past situa- 
tions and to their expectations about future success. Thus, the knowledge 
newcomers have acquired through past experience and the strategies they 
have developed in the past to deal with ambiguity or uncertainty may also 
influence how they select information and respond to new situations. 

For example, people with high levels of self-efficacy might be expected 
to define their new roles differently than those with low levels because of 
differing beliefs about their personal competency. As a result, newcomers 
with high levels of self-efficacy may take proactive stances toward role per- 
formance in order to demonstrate their abilities; alternatively, the sociali- 
zation tactics used may not affect their subsequent role orientations. These 
newcomers will interpret situations as they see fit. Conversely, individuals 
with low levels of self-efficacy may more readily accept definitions of situa- 
tions offered by others. In general, it may be argued that when organizations 
employ institutionalized methods of socialization, custodial role orienta- 
tions are more likely to result among individuals possessing low rather than 
high levels of self-efficacy. This leads to the following hypothesis: 

Hypothesis 4: A newcomer’s level of self-efficacy will mod- 
erate the effects of institutionalized tactics on role orien- 
tation. 

In sum, the purposes of this study were to investigate (1) how socializa- 
tion tactics affect role orientations and personal adjustments to organizations, 
and (2) whether level of self-efficacy moderates the effects of socialization 
tactics on role orientations. 


METHODS 
Respondents and Procedures 


In order to investigate the effects of both individual and organizational 
variables on newcomers’ adjustments to organizations, this study employed 
a longitudinal research design. Respondents, all M. B. A. students from two 
successive annual graduating classes of a major midwestern university, com- 
pleted two questionnaires. They completed the first after they had accepted 
jobs but before they joined their organizations; they completed the second 
approximately five months after joining. This time period was chosen to 
capture newcomers’ responses after they had had time to experience social- 
ization processes and develop role orientations. Respondents received ques- 
tionnaires to be completed before joining organizations at the time of their 
exit interviews. They were requested to mail their responses to the researchers. 
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The second questionnaire was mailed to respondents and follow-up letters 
were sent one month later. 
The questionnaire completed before entry measured levels of self-efficacy. 
- The second questionnaire measured the six socialization tactics and the six 
outcome variables: role orientation, role conflict, role ambiguity, job 
satisfaction, intention to quit, and commitment. The first questionnaire was 
completed by 127 respondents from a population of 282, giving a 45 percent 
response rate; 102 of the original 127 (an 80 percent response rate) com- 
pleted the second questionnaire. Thus, overall 36 percent of the original 
population responded. In order to investigate whether the characteristics of 
the final group differed from those of the original one, the study compared 
the original group’s M. B. A. concentrations, as expressed in their exit 
interviews, and the final group’s functional specializations in their organi- 
zations, as expressed in the second questionnaire. In general, the propor- 
tions of respondents in various business functions like finance, marketing, 
and so forth, were similar to the proportions of respondents in the various 
M. B. A. concentrations. This provides some support for the representative- 
ness of the final group. Respondents had joined 96 widely diverse firms 
located mainly in the Midwest and Southwest: major accounting firms, banks, 
manufacturing and service companies, as well as small, specialized organi- 
zations. There were 73 men and 29 women in the final group. The average 
age of respondents was 24.7 years. 


Measures 


Self-efficacy. Following Bandura (1977, 1978), self-efficacy was mea- 
sured in terms of the people’s expectations that “they can successfully exe- 
cute the behavior required to produce the outcome" (Bandura, 1977: 192], in 
this case, mastery of role and organizational requirements. The self-efficacy 
scale consisted of eight items scored on 7-point Likert-type scales ranging 
from “strongly agree” to “strongly disagree" (a = 71). The Appendix lists the 
items in this scale. 

Socialization tactics. The six socialization tactics were operationally 
defined in terms of Van Maanen and Schein's (1979) discussion. Care was 
taken to ensure that each facet of each tactic figured as an item in each scale. 
Also, care was taken to endow questions with an active, behavioral, rather 
than an affective, evaluative tone in order to reduce common method variance. 
The Appendix lists the six 5-item scales that resulted from this procedure. 
Table 2 gives the reliabilities of these scales and all others as its diagonal 
elements. Although the different ends of the scales bear different names in 
Table 2 for purposes of clarity (e.g., fixed versus variable), the scales are not 
bipolar, but are based on a Likert-type format. 

To further investigate the relationships between tactics, varimax factor 
analysis was performed. Table 1 gives the results of this procedure. Both of 
the social tactics load on Factor 1 and both of the content tactics load on 
Factor 2. However, formal versus informal and collective versus individual 
tactics load largely on different dimensions, indicating that not all formal 
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TABLE 1 ' 
Rotated Factor Loadings from Factor Analysis 
for the Socialization Tactic Scales 





Factors ? 
Items ^ 1 2 3 4 





Investiture vs. divestiture 
ID1 č -57 
ID2 74 
ID3 ; e OB 
ID4 .62 
ID5 76 
Serial vs. disjunctive 
SD1 i AL 
SD2 .67 
SD3 .63 
SD4 46 
SD5 44 
Serial vs. random 
SR1 61 
SR2 42 
SR3 43 
SR4 46 
SR5 73 
Fixed vs. variable 
FV1 .62 
FV2 . 55 
FV3 Bed 45 
FV4 .56 
FV5 55 
Collective vs. individual 
Ch 75 
CI2 .45 
CI3 .57 
Cl4 62° 
CIS .67 
Formal vs. informal 
Fin .48 .48 
FI2 
FI3 i .67 
Pia :53 
FI5 .46 





a Only loadings above .40 are presented. 
? For items identified by abbreviations, refer to Appendix. 


socialization programs were collective, nor all informal programs individual 
(Van Maanen & Schein, 1979). These results provide some support for the 
classificatory scheme in Figure 1. . 

Outcome measures. Role orientation was measured on a 5-item Likert- 
type scale using a 1-7 response. format ranging from "strongly disagree" to 
"strongly agree." A high score on orientation indicates an innovative 
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orientation, in which a newcomer attempts to alter procedures for perform- 
ing a role, the purpose of the role itself, or both. A low score indicates a 
custodial orientation, in which a newcomer accepts the prescribed limits of 
a role in an organization. Role conflict and ambiguity were measured with 
the scales developed by Rizzo, House, and Lirtzman (1970). 

In order to ensure that role orientation was conceptually and empirically 
independent of role conflict and ambiguity, a factor analysis of the 20 items 
composing these scales was performed. Three factors with eigenvalues greater 
than 1 were obtained; these accounted for 93.6 percent of the common 
variance. Examination of the item loadings using a .40 cutoff revealed that 
Factor 1 contained only those items referring to role ambiguity; Factor 2 only 
role conflict items; and Factor 3 only role orientation items. This supported 
the analytical separation of these variables. 

For the personal outcome variables, commitment was measured using a 
shortened version of the scale by Porter, Steers, Mowday, and Boulian (1974). 
(See the Appendix.) Satisfaction was measured on the 7-point Faces scale 
(Dunham & Herman, 1975; Kunin, 1955) as a summary measure of overall job 
satisfaction in order to reduce method variance. Intention to quit was mea- 
sured by a 2-item scale: “I will probably look for a new job in the coming 
year,” and “I scan the newspapers and other sources for prospective jobs.” 


RESULTS 


Intercorrelations among all variables appear in Table 2. Examining the 
intercorrelations between the scales measuring socialization tactics reveals a 
high degree of intercorrelation between five of the six scales. Investiture- 
divestiture has the smallest degree of intercorrelation with the other tactics 
(r = .28). However, in all cases intrascale reliability estimates (a) are of 
greater magnitude than interscale Pearson correlations. This suggests at least 
a tolerable level of discriminant validity between tactics (Buchanan, 1974). 
Moreover, as discussed in earlier sections of this paper, theory suggests that 
the use of these tactics will be highly correlated, and the pattern of correla- 
tion between the tactics supports this hypothesis. For example, investiture 
correlates most highly with serial socialization tactics, and sequential tactics 
correlate most highly with fixed tactics. 

In regard to the issue of the relationship between socialization tactics 
and role orientation, it appears that an innovative role orientation is signifi- 
cantly and negatively related to each of the institutional methods of 
socialization. On the other hand, individualized socialization programs were 
associated with relatively high levels of role conflict and ambiguity plus 
innovative role orientations. This implies that high levels of role conflict and 
ambiguity are associated with performing roles in innovative ways. Either 
newcomers self-generate dissonance through how they perform their roles, 
or their actions lead others to question how they perform. For example, 
organizational incumbents’ resisting newcomers’ attempts to redefine their 
toles will raise the level of role conflict and ambiguity that the newcomers 
experience. Newcomers will be forced to trade off their desires to perform 
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their roles innovatively against desires to reduce uncertainty by lowering 
levels of role conflict and ambiguity. Conversely, custodial role orientations 
are associated with low levels of role conflict and ambiguity, as might be 
expected. This issue is explored further later in the Results section. 

The relationship between socialization tactics and personal outcomes 
also shows a consistent pattern of results. The more institutionalized the 
form of socialization was, the greater were expressed job satisfaction and 
commitment, and the lower was intention to quit. Thus, there appears to be 
considerable support for the hypothesized relationship between institution- 
alized tactics and reduction of anxiety. 

Further analysis was conducted in order to investigate the relationship 
between personal and role outcomes and the six socialization tactics, and 
also to discover which tactics have most impact upon these outcomes. Since 
there are multiple dependent and multiple independent variables, canonical 
correlation analysis was used. This technique was appropriate because its 
aim is to develop parsimonious linear combinations of dependent and 
independent variables that maximize correlations between each set of linear 
combinations. Table 3 presents ihe results of the canonical analysis. Here, 
the structure coefficients are the beta weights used to form the respective 
dependent and independent linear combinations. In other words, applying 


TABLE 3 
Results of Canonical Analysis of the Relationships of Socialization 
Tactics with Personal and Role Outcomes 





Canonical Wilks’s Canonical Coefficients of 
Roots Eigenvalues Lambda F Correlations Redundancy 

1 2.52 18 5.27** -B5 BN 

2 0.28 .63 1.74* 47 .64 

Canonical Set I Canonical Set H 

Structure Standardized Structure  . Standardized 
Variables Coefficients Coefficients Coefficients Coefficients 
Dependent variables 
Role orientation ; -.56 -.19 67 .92 
Role conflict -.76 -.08 -.20 -.59 
Role ambiguity -77 -.39 ER 31 
Commitment .78 .25 .34 .37 
Job satisfaction Dë 44 .28 11 
Intention to quit -.51 11 .05 .18 
Independent variables 
Collective vs. individual .26 -41 -.77 -.91 
Formal vs. informal .33 .18 -.32 71 
Fixed vs. variable .65 -.05 -.50 -.45 
Sequential vs. random .69 43 -.53 -.05 
Investiture vs. divestiture .85 .55 .20 .69 
Serial vs. disjunctive .82 .38 -.43 -.30 
* p « .05 


** p « .001 
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these standardized weights to the respective variables produces the canoni- 
cal variables. The standardized coefficients are the respective zero-order 
correlations of the variables with the linear combinations De, canonical 
variables). Because previous research (Cooley & Lohnes, 1971) has shown 
that the structure coefficients have the greatest stability, they were used for 
interpretive purposes. 

The results of the canonical analysis give two significant canonical 
functions, with the first accounting for most of the variance. (The coefficient 
of redundancy is the multivariate analog of r°.) It is immediately obvious that 
the structure coefficients for both independent variable canonical functions 
are almost mirror images of one another. With the exception of investiture 
versus divestiture, whose loading falls from .85 to .20, the first canonical 
function represents an institutionalized pattern of socialization tactics, while 
the second canonical function represents an individualized pattern. Relating 
these results to the structure coefficients of the dependent variables reveals 
that for the first canonical function, high job satisfaction and commitment, 
and low role conflict and ambiguity load the heaviest. For the independent 
variables, investiture, serial, sequential, and fixed tactics load highest. The 
results support Hypothesis 2, stating that institutionalized tactics will be 
most significant in mediating personal adjustments to organizations. On the 
second canonical function, of the dependent variables only role orientation 
loads high (0.67), and individual, random, variable, and disjunctive social- 
ization tactics, in that order, load highest. These results suggest that an 
innovative role orientation is the main outcome of individualized practices, 
or conversely, that institutionalized tactics are the tactics most likely to pro- 
duce custodial role orientations. Thus, taken together, these results support 
Hypotheses 1 and 2 and suggest that institutional and individualized social- 
ization tactics have different effects on role orientation and personal 
adjustment. 

Hypothesis 3 suggested that investiture and serial tactics would be most 
important in mediating personal adjustments to organizations. The loadings 
of the six tactics on canonical function 1 support this proposition. Investi- 
ture and seríal tactics load highest, followed by fixed and sequential tactics, 
with collective and formal tactics loading lowest. These results appear to 
suggest that social rather than context types of institutional socialization 
tactics will have more effect on newcomers' transitions into organizations. 

The next issue was the effect of level of self-efficacy on role orientation. 
The Pearson correlation coefficients in Table 2 suggest that role orientation 
is positively related to self-efficacy. In order to test the hypothesis that 
self-efficacy moderates the effects of socialization tactics on role orientation, 
a series of moderated regressions were performed. A separate regression 
equation was run for each socialization tactic with role orientation as the 
dependent variable. In each equation, self-efficacy and an interaction term 
composed of the product of the socialization tactic and self-efficacy were 
included. The interaction term was significant (p « .05) for three of the six 
socialization tactics. To conserve space, this paper does not show the results 
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of these regressions. Instead, a subgroup correlation analysis that provides 
similar information was subsequently performed in accordance with Peters 
and Champoux's (1979) recommendation. A median split on self-efficacy 
scores was performed and separate correlations were then computed for the 
relationship between role orientation and each socialization tactic. Table 4 
presents correlations and subgroup differences. These results suggest that 
institutionalized socialization tactics result in considerably stronger custo- 
dial orientations when individuals possess low rather than high levels of 
self-efficacy. It appears that individuals low in self-efficacy more readily 
conform to definitions of situations offered by others. In other words, level of 
self-efficacy moderates the effects of socialization tactics on role orientation. 
These results provide some support for an interactionist perspective. In order 
to understand the nature of the newcomers' adjustments to organizations, it 
is necessary to consider the combined effects of individual and organiza- 
tional factors. 


DISCUSSION AND CONCLUSIONS 


Empirical investigation of the way in which socialization tactics affect 
personal and role outcomes has lagged behind the theoretical conceptualiza- 
tion of these issues (Feldman, 1976). In recent work, both Van Maanen and 
Schein (1979) and Louis (1980) suggested that reducing uncertainty is the 
major goal of newcomers to organizations and that the methods of socializa- 
tion that organizations use can influence the way newcomers respond to 
their organizations. The findings of this study support this hypothesis, and 
the canonical results reported here reveal a pattern of relationships between 
tactics and outcomes supporting the proposition that different socialization 
tactics lead to different outcomes of socialization. The social dimensions of 


TABLE 4 
Moderating Influence of Self-Efficacy on Correlations 
between Role Orientation and Socialization Tactics 





Correlations with Role Orientation 
Significance of 


Socialization Tactics Low Self-Efficacy High Self-Efficacy Difference? 
Gollective vs. individual -4n*** -.25* 10 
Formal vs. informal ^42*** -.12 .06 
Sequential vs. random -.57*** -4g*** n.s. 
Fixed vs. variable i ~.63*** -.22 : 01 
Investiture vs. divestiture -.48*** -.09 .05 
Serial vs. disjunctive -.68*** -.33** 05 


3 The significance of the difference between subgroup correlations was tested applying Fisher's 
r to z transformation and examining the differences using a Z test. 
*p«.05 
** pe Di 
*** p < 001 
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socialization—investiture and serial processes—-appear to be particularly sig- 
nificant in influencing role orientations and subsequent adjustments to 
organizations. However, the content of the information given to newcomers 
concerning role requirements and their future organizational progress also 
appears to reduce the uncertainty surrounding the entry process. Both collec- 
tive and formal socialization tactics were relatively less influential, perhaps 
because they define newcomers as learners and increase the psychological 
distance between newcomers and organizations. The importance the litera- 
ture has attributed to these variables may be a consequence of authors’ using 
highly unusual or extreme settings, such as military socialization programs, 
as illustrations. In such settings recruits have no other choice but to respond 
to organizational commands. Very few organizations have such captive audi- 
ences to experiment on. 

These results also suggest that the different ends of the six continua of 
socialization tactics, rather than a mix of tactics, are instrumental in leading 
to particular role orientations. Moreover, contrary to Van Maanen and Schein’s 
(1979) argument, both fixed and investiture tactics, rather than variable and 
divestiture tactics, are associated with custodial responses. This pattern sup- 
ports this study’s hypothesis and suggests that two factors combine to dis- 
courage newcomers from performing their roles in ways that are contrary to 
those customary in an organization. These discouraging factors are ability to 
predict progress in an organization and definitions of situations offered by 
others. The finding that role orientation is positively correlated with role 
conflict and ambiguity further supports this interpretation; failure to accept 
significant others’ definitions of situations results in increasing uncertainty 
or anxiety for newcomers. 

As argued in an earlier section, newcomers are likely to have proclivities 
to interpret situations in certain ways and to respond to them in their own 
terms. This study found that newcomers’ levels of self-efficacy moderated 
the effects of institutionalized tactics on role orientation, in support of this 
hypothesis. This finding suggests that newcomers use self, as well as others, 
as a locus of control when evaluating situations (Weiss, 1978). The results 
suggest that newcomers high in self-efficacy will tend to define situations 
themselves even when their roles or progressions in organizations are 
prescribed. 

However, caution should be exercised in interpreting these results 
because this study relied exclusively on self-reported measures. Studies 
employing only self-reported data have raised concern among organizational 
researchers, because common method variance may affect the magnitude of 
relationships between work characteristics and work outcomes. The use of a 
longitudinal research design partially obviated this problem so far as the 
moderating effects of self-efficacy are concerned because self-efficacy mea- 
sured at time 1 was analyzed with tactics and outcomes measured at time 2. 
Moreover, as discussed in the Methods section, care was taken to operation- 
ally define the measures of socialization tactics in as objective a way as 
possible, and the relationships between the outcome measures are consistent 
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with those found in many studies. Further, all hypotheses were directional 
and all were supported as stated. Nevertheless, it would have been useful to 
have had independent sources like supervisors confirm the nature of social- 
ization tactics. The original research design attempted to get such confirma- 
tion, but respondents’ anxieties about anonymity and feedback to supervisors 
-prevented its use. The procedure in effect increased newcomer anxiety. 
Finally, all methodologies have costs and benefits. Although common method 
_variance may be a cost of survey research, this study gained one main benefit 
from its use. This benefit was the opportunity to examine socialization prac- 
tices across widely diverse organizations, which provided the variability in 
responses necessary for meaningful analysis. 

In conclusion, it is suggested that future research be conducted across 
different firms that hire similar candidates but treat them differently. These 
settings should produce different results that will allow further investigation 
of the effects of socialization tactics on newcomers’ adjustments. It would be 
interesting to study organizations’ socialization practices in conjunction with 
newcomers’ tasks. There is likely to be some correspondence between the 
demands of tasks and socialization practices; this correspondence would 
resemble in kind the relationships between choice of leader behaviors and 
task constraints that path-goal theories of leadership have posited (House, 
1971). Such research may make it possible to fathom why firms use certain 
kinds of tactics rather than others. It- would also be interesting to investigate 
how organizations’ sizes or natures influence their choices of socialization 
tactics. It is likely that research that considers the requirements of different 
jobs as to skills and roles will find factors based on individual differences 
even more salient than did this study. Finally, also deserving of investiga- 
tion is the degree to which newcomers may, because of individual differences, 
self-select organizations because they know what kinds of socialization and 
training the organizations use. This issue is likely to be particularly impor- 
tant in the case of newcomers who join unusual socialization programs such 
as the military’s. The measures of socialization and self-efficacy developed 
in this study may, with suitable modification, prove useful in such future 
research. 


REFERENCES 
Bandura, A. 1977. Self-efficacy: Towards a unifying theory of behavioral change. Psychological 
Heview, 84: 191—215. 


Bandura, A. 1978. The self-system in reciprocal determination. American Psychologist, 33: 
344—358. 


Buchanan, B. 1974. Building organizational commitment: The socialization of managers in 
work organizations. Administrative Science Quarterly, 19: 533—546. 


Cooley, W., & Lohnes, P. R. 1971. Multivariate data analysis. New York: john Wiley & Sons. 


Dunham, R., & Herman, J. 1975. Development of female Faces scale for measuring job satisfaction. 
Journal of Applied Psychology, 60: 629—631. 


1986 Jones 277 


Feldman, D. C. 1976. A contingency theory of socialization. Administrative Science Quarterly, 
21: 433—452. 


House, R. J. 1971. A path-goal theory of leader effectiveness. Administrative Science Quar- 
terly, 16: 321—339. 


Hughes, E. C. 1958. Men and their work. Glencoe, Ill.: Free Press. 


Jones, G. R. 1983a. Psychological orientation and the process of organizational socialization: An 
interactionist perspective. Academy of Management Review, 8: 464—474. 


Jones, G. R. 1983b. Organizational socialization as information processing activity: A life his- 
tory analysis. Human Organization, 42: 314—320. 


Katz, R. 1978. Job longevity as a situational factor in job satisfaction. Administrative Science 
Quarterly, 23: 205—223. 


Kunin, T. 1955. The construction of a new type of attitude measure. Personnel Psychology, 8: 
65—78. FR 


Louis, M. R. 1980. Surprise and sense-making: What newcomers experience in entering unfamiliar 
organizational settings. Administrative Science Quarterly, 25: 226—251. 


Peters, W. S., & Champoux, J. E. 1979. The use of moderated regression in job redesign decisions. 
Decision Sciences, 10: 85—95. 


. Porter, L., Steers, R, M., Mowday, R. T., & Boulian, P. V. 1974. Organizational commitment, job 
satisfaction, and turnover amongst psychiatric technicians. Journal of Applied Psychol- 
ogy, 59: 603—609. 


Rizzo, J. R., House, J. H. & Lirtzman, S.I. 1970. Role conflict and ambiguity in complex 
organizations. Administrative Science Quarterly, 15: 150—163. 


Schein, E.H. 1968. Organizational socialization and the profession of management. Industrial 
Management Heview, 2: 59—77. 


Van Maanen, J. 1977. Experiencing organizations. In J. Van Maanen (Ed.), Organizational 
careers: Some new perspectives: 15—45. New York: John Wiley & Sons. 


Van Maanen, J., & Schein, E. H. 1979. Towards a theory of organizational socialization. In B. M. 
Staw (Ed.), Research in Organizational Behavior, vol. 1: 209—264. Greenwich, Conn.: JAI 
Press. 


Weiss, H. 1978. Social learning of work values in organizations. Journal of Applied Psychol- 
ogy, 63: 711—718. 


APPENDIX 


Unless otherwise noted, responses were measured on 7-point scales ranging from "strongly 
disagree" to "strongly agree." (R) indicates reverse scoring. 


Scales Measuring Socialization Tactics 
Collective versus individual: 


CH In the last six months, I have been extensively involved with other new recruits in 
common, job related training activities. 

CI2 Other newcomers have been instrumental in helping me to understand my job require- 
ments. 

CI3 This organization puts all newcomers through the same set of learning experiences. 

Ci4 Most of my training has been carried out apart from other newcomers. (R) 

CI5 There is a sense of “being in the same boat" amongst newcomers in this organization. 
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Formal versus informal: 


Fn 


FI2 


FI3 


FI4 
FI5 


I have been through a set of training experiences which are specifically designed to give 
newcomers a thorough knowledge of job related skills. 

During my training for this job I was normally physically apart from regular organiza- 
tional members. 

I did not perform any of my normal job responsibilities until I was thoroughly familiar 
with departmental procedures and work methods. 

Much of my job knowledge has been acquired informally on a trial and error basis, (R) 
I have been very aware that I am seen as “learning the ropes” in this organization. 


Investiture versus divestiture: 


D 


ID2 
ID3 
ID4 
IDS 


I have been made to fee] that my skills and abilities are very important in this organi- 
zation. 

Almost all of my colleagues have been supportive of me personally. 

I have had to change my attitudes and values to be accepted in this organization. (R) 
My colleagues have gone out of their way to help me adjust to this organization. 

I feel that experienced organizational members have held me at a distance until I 
conform to their expectations. (R) 


Sequential versus random: 


SR1 


SR2 


SR3 


SR4 


SR5 


There is a clear pattern in the way one role leads to another or one job assignment leads 
to another in this organization. 

Each stage of the training process has, and will, expand and build upon the job know- 
ledge gained during the preceeding stages of the process. 

The movement from role to role and function to function to build up experience and a 
track record is very apparent in this organization. 


. This organization does not put newcomers through an identifiable sequence of learning 


experiences. (R) 
The steps in the career ladder are clearly specified in this organization. 


Serial versus disjunctive: 


SD1 


SD2 


SD3 


SD4 


SIS 


Experienced organizational members see advising or training newcomers as one of 
their main job responsibilities in this organization. 

I am gaining a clear understanding of my role in this organization from observing 
my senior colleagues. i 

I have received little guidance from experienced organizational members as to how 
I should perform my job. {R} 

I have little or no access to people who have previously performed my role in this organ- 
ization. (R) 

I have been generally left alone to discover what my role should be in this organi- 
zation. (R) 


Fixed versus variable: 


FV1 


I can predict my future career path in this organization by observing other people’s 
experiences. 

I have a good knowledge of the time it will take me to go through the various stages 
of the training process in this organization. 

The way in which my progress through this organization will follow a fixed timetable 
of events has been clearly communicated to me. 

I have little idea when to expect a new job assignment or training exercise in this organi- 
zation. (R) 
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FV5 Most of my knowledge of what may happen to me in the future comes informally, 
through the grapevine, rather than through regular organizational channels. (R) 


Scale Measuring Self-Efficacy 


My new job is well within the scope of my abilities. 

I do not anticipate any problems in adjusting to work in this organization. 

I feel I am overqualified for the job I will be doing. 

I have all the technical knowledge I need to deal with my new job, all I need now is practical 
experience. i 

I feel confident that my skills and abilities equal or exceed those of my future colleagues. 

My past experiences and accomplishments increase my confidence that I will be able to per- 
form successfully in this organization. 

I could have handled a more challenging job than the one I will be doing. 

Professionally speaking, my new job exactly satisfies my expectations of myself. (R) 


Scales Measuring Outcomes 
Role orientation: 


Responses were measured on a 7-point scale ranging from “very false" to “very true.” " 
I have made an attempt to redefine my role and change what I am required to do. 
While I am satisfied with my overall job responsibilities, I have altered the procedures for 
doing by job. 
I have changed the mission or purpose of my role. 
The procedures for performing my job are generally appropriate in my view. (R) 
I have tried to change the procedures for doing my job and to institute new work goals. 


Commitment to organization: 


I talk up this organization to my friends as a great organization to work for. 

I feel very little loyalty to this organization. {R} 

I would accept almost any type of job assignment in order to keep working for this organization. 

I am proud to tell others that I am a part of this organization. 

I could just as well be working for a different organization as long as the type of work was 
similar. (R) 

It would take very little change in my present circumstances to cause me to leave'this organization. 
(R) . 

Iam extremely glad I chose this organization to work for over others I was considering at the 
time I joined. 

There is not much to be gained by sticking with this organization indefinitely. (R) 

For me, this is the best of all possible organizations for which to work. 

Deciding to work for this organization was a definite mistake on my part. (R) 


All other items and response formats used were identical to those of the established scales 
referred to in the Methods section. 
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OUTCOMES OF AUTONOMOUS WORKGROUPS: 
A LONG-TERM FIELD EXPERIMENT 
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A quasi-experimental design was used to study the long-term effects of 
implementation of autonomous workgroups in a manufacturing envi- 
ronment, Results indicated a substantial and lasting effect on employees’ 
intrinsic job satisfaction, a more temporary effect on extrinsic job 
satisfaction, and no consequences for work motivation or performance. 
Atan organizational level, however, improvements in productivity were 
made possible through elimination of supervisory positions; and con- 
trary to prediction, labor turnover increased. 


Autonomous workgroups represent a form of work design that has evolved 
from the general sociotechnical systems approach to organizational analysis 
and design (Emery, 1959; Katz & Kahn, 1978; Susman, 1976). Although 
comparisons are difficult, it seems that more organizations have implemented 
work designs with autonomous workgroups on their shopfloors than have 
implemented job characteristics approaches oriented to individuals. Certainly, 
many observers have commented on the increased use of autonomous 
workgroups in a number of industrialized countries (Cummings, 1978; 
Pasmore, Francis, Haldeman, & Shani, 1983), Paradoxically, empirical evi- 
dence concerning the effects of such groups on employee attitudes and behav- 
ior has not shown a corresponding advance. The present study, a large scale, 
long-term, outcome evaluation of autonomous workgroups based on a quasi- 
experimental research design, helps to remedy this deficiency. By way of 
introduction we outline the nature of autonomous workgroups, consider 
their theoretically predicted effects on employee attitudes and behavior, and 
discuss the need for the kind of empirical investigation described in this 
paper. 


THEORETICAL BACKGROUND AND RESEARCH AIMS 
The Nature of Autonomous Workgroups 


The key feature of autonomous workgroups is a high degree of self- 
determination by employees in the management of their day-to-day work. 
Typically this includes collective control over the pace of work, distribution 
of tasks, organization of breaks, and collective participation in the recruit- 
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ment and training of new members (Gulowsen, 1972). Often direct supervi- 
sion is unnecessary (Emery, 1980). To allow for such self-management, the 
organization of work commonly has these three characteristics: (1) employ- 
ees with functionally interrelated tasks who collectively are responsible for 
end products; (2) individuals who have a variety of skills so they may under- 
take all or a large proportion of groups’ tasks; and (3) feedback and evalua- 
tion in terms of the performance of whole groups. 

It will not go unnoticed that the properties prescribed for autonomous 
workgroups parallel the hallmarks of the job characteristics approach to job 
design (Hackman & Lawler, 1971; Hackman & Oldham, 1976). These are 
variety, task identity, autonomy, and feedback. On this basis, Rousseau (1977) 
argued for a synthesis of the two approaches, as did Cummings (1978) and 
Denison (1982). Hackman (1977) went further and suggested that autono- 
mous group working is the more powerful of the two approaches because 
groups can undertake much larger pieces of work than individuals can and 
so allow more fundamental manipulation of work characteristics. 

Thus, previous work has specified the core characteristics of autono- 
mous workgroups relatively clearly, tying them not only to the sociotechnical 
literature, but also to job design theory in general. The main distinction 
between autonomous workgroups and the job characteristics approach con- 
cerns level of application and analysis, not content. For autonomous groups, 
job characteristics are a property of group work, not of individual jobs. 
However, this difference is not apparent in the predicted effects of these two 
approaches to work design. 


Effects of Implementation of Autonomous Workgroups 


Sociotechnical theorists have explicitly, or more often implicitly, 
described rationales for and effects of autonomous workgroups in terms of 
individual psychological reactions (Davis, 1966; Kelly, 1978). The underly- 
ing assumptions have been that this way of organizing work is intrinsically 
motivating and enhances employee satisfaction; from these reactions follow 
improved group performance and reduced labor turnover. Here again there 
is a direct correspondence to the individual job design approach; in their job 
characteristics model, Hackman and Oldham (1976) posited these same causal 
relationships. It is thus not surprising that Hackman (1983) developed a 
normative theory of workteam effectiveness from the literatures on job design 
and small group behavior. Though it deals with wider issues concerning the 
composition of workgroups, at the heart of this integrative model are the 
same propositions concerning the implications of autonomous group work 
for motivation, member satisfaction, performance, and labor turnover. 

The literature suggests two further effects of autonomous workgroups, 
increased organizational commitment (Emery, 1959) and improved mental 
health (Herbst, 1974). Indeed, Herbst reported that the “starting point of the 
early coal studies was provided by psychiatric investigations . . . which gave 
evidence of an epidemic incidence of psychosomatic disorders among miners” 
(1974: 3). The implication is that a change to autonomous group work would 
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reduce such disorders. Other research (Alcalay & Pasick, 1983; Broadbent, 
1985; Fraser, 1947; Gardell, 1971; Herzberg, 1966; Karasek, 1978; Kornhauser, 
1965; Wall, Clegg, & Jackson, 1978; Wall & Clegg, 1981) undertaken in many 
different work settings gives general support to this view. Neither organiza- 
tional commitment nor mental health is a central element in existing theoreti- 
cal propositions; nevertheless, these outcome variables were included in 
the present investigation to continue that line of research. 


Need for Field Experimentation 


Although the nature and predicted effects of autonomous workgroups 
are relatively clear from the theoretical literature, research on their effective- 
ness has been less definitive. Lawler concluded that "available research 
consists mainly of poorly documented case studies and correlational studies" 
(1977: 576) and called for research based on quasi-experimental designs. 
Cummings, Molloy, and Glen reviewed 58 studies with work experiments 
and found “the studies fared badly when assessed against the [quasi- : 
experimental] criteria used in this critique” (1977: 702). Pasmore and col- 
leagues argued that “more closely controlled experimental introductions of 
work restructuring are called for, with appropriate groups serving as controls 
during the experiments" (1983: 1198); Roberts and Glick (1981) endorsed 
this view in relation to job design research in general. 

With respect to studies of autonomous workgroups, Lawler’s (1977) alle- 
gation of low quality was perhaps harsh. There exist several careful accounts 
of such innovations (Davis & Valier, 1965; Kemp, Clegg, & Wall, 1980; Kemp, 
Wall, Clegg, & Cordery, 1983; Rice, 1958; Trist & Bamforth, 1951; Trist, 
Susman, & Brown, 1977). Nevertheless, it is a valid point that very few of 
the findings are based on research designs allowing causal inferences. A 
point less often raised, but nonetheless important, concerns the time spans of 
research into work design. Most studies, particularly those with relatively 
rigorous designs, limit their evaluation to but a few months. But work designs 
try to achieve enduring changes in employees’ attitudes and behaviors, so a 
perspective in terms of years rather than months is desirable. Previous 
research can be largely interpreted as supporting only short-term and thus 
possibly transitory effects (Wall & Clegg, 1981). 

The absence of long-term experimental studies of autonomous work- 
groups is not altogether surprising. Research of this nature is time consum- 
ing and demands considerable resources. Moreover, the opportunities are 
few. Because this research is of immense theoretical and practical importance, 
such opportunities should be exploited when they arise. Such an opportu- 
nity presented itself to us, as the remainder of this paper describes. 


Aims of the Present Study 


The primary aim of this long-term experiment was to examine the effects 
of autonomous workgroups on employees’ attitudes and behaviors. More 
specifically, this experiment tested the proposition that this form of work 
design enhances work motivation, job satisfaction, and group performance, 
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and reduces labor turnover. We also included organizational commitment 
and mental health as outcome variables for exploratory purposes. The study’s 
secondary aim was to report and comment on the organizational issues 
arising from extended use of autonomous workgroups. 


METHODS 
Background, Setting, and the New Form of Work Organization 


This study was set within a large, nonunionized, British company that 
produces confectionery at several sites for the home market and export. 
When planning a new factory in the southeast of England, senior manage- 
ment decided to take the opportunity to try a new method of working that 
recognized the needs and capitalized upon the skills of their employees. 
This choice reflected an ideological commitment to increasing responsibility 
on the shopfloor and an understanding of modern principles of social sci- 
ence governing work design. The company engaged a consultant from the 
Tavistock Institute of Human Relations, Dr. Lisl Klein, to work alongside 
managers, engineers, and architects throughout the planning process. The 
outcome of this collaboration was a modern, spacious, and comfortable fac- 
tory whose management structures and practices were designed to support a 
well-developed form of autonomous group working. 

From the outset of the study we were independent from the consultant 
and were not involved in the design and implementation processes. Our role 
was to evaluate the effects of autonomous workgroups over a period of three 
years, beginning with the start-up of production. We accepted no fees from 
the organization and guaranteed both confidentiality of individual views 
and company anonymity. 

The intent of the new form of work organization was to provide shopfloor 
employees with substantial autonomy in carrying out day-to-day production 
along with a high level of involvement in operational decision making. In 
practice, production employees worked in groups of 8 to 12 people, all of 
whom were expected to carry out each of eight types of jobs involved in the 
production process. Group members were collectively responsible for allocat- 
ing jobs among themselves, reaching production targets and meeting quality 
and hygiene standards, solving local production problems, recording produc- 
tion data for information systems, organizing breaks, ordering and collecting 
raw materials and delivering finished goods to stores, calling for engineering 
support, and training new recruits. They also participated in selecting new 
employees. Within each group, individuals had considerable control over 
the amount of variety they experienced by rotating their tasks, and each 
production group was responsible for one product line. Group members 
interacted informally throughout the working day but made the most impor- 
tant decisions—for example, regarding job allocation—at formal weekly group 
meetings where performance was also discussed. This method of working 
thus meets all the major criteria of autonomous group work. 
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However, perhaps the most marked feature of this form of work organi- 
zation concerned supervision. Since nearly all the functions typically car- 
ried out by supervisors were the responsibility of shopfloor employees, the 
position of supervisor did not exist. Groups reported directly to a support 
manager who had overall responsiblity for some 40 to 70 employees and 
whose role was to provide the necessary support and guidance for the develop- 
ment of both social and technical skills within the workgroups. Support 
managers were responsible for providing the facilities and creating the condi- 
tions within which groups could operate in a semiautonomous fashion, while 
maintaining disciplinary procedures and running an effective production 
facility. 

With one exception, the managers employed in the new factory were 
appointed from elsewhere in the company. Some were highly committed to 
the new working method, some open-minded, and others somewhat skeptical. 
None had previous experience of group working nor of developing a green- 
field site.* They began work on site prior to the factory's commission. Several 
were involved in planning the factory over a two-year period. With help 
from the factory personnel manager, managers were responsible for recruit- 
ing employees for their own departments. Inevitably they attempted to select 
and recruit people who would be able to work in groups, but they had no 
specific operational criteria to guide this process. Immediately prior to the 
start-up of production, all employees took a one-week induction course run 
by the factory management; the course focused on how the factory would be 
organized and managed with particular reference to the theory and practice 
of autonomous group work. Such induction courses continue to be run on a 
smaller scale than the original effort for all new employees. Over time, man- 
agement transferred key personnel, those most skilled and experienced in 
the production process, within the factory to help develop new workgroups. 
In short, the workgroups have not been stable over time. In the factory as a 
whole, several features reflected the philosophy underlying autonomous 
group working: a single-status canteen, a single-status car park, no timeclocks, 
monthly payment by credit transfer, and a flat (three-level) management 
structure. 


Research Design 


The overall design of this study was quasi-experimental. It consisted of 
four conditions and up to three measurement occasions, as Table 1 shows. 
Condition A, the main experimental group, included all day-shift employ- 
ees in the new factory who from the outset of production worked under the 
system of autonomous workgroups described in the preceding section. The 
company initially recruited 42 employees; the number rose to 84 during the 


1 The term “greenfield” is used here, and subsequently, to refer to a site where a new factory 
is built and commissioned by the parent company. It is thus specifically designed for current 
purposes, unlike existing premises, which were previously used for other purposes, and then re- 
furbished for new ones. 
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course of the study. The numbers of employees reported in Table 1 corre- 
spond to the actual numbers of employees participating in the study. 

Within the overall research design, condition B served initially as a non- 
equivalent control group and later as a second experimental group. It 
included all employees on the evening shift at the new factory who made 
the same products using the same machinery as their day-shift (condition 
A) counterparts. For the first 12 months of the investigation their work was 
conventionally organized. Employees were allocated to particular tasks 
and a supervisor who reported to a shift manager had control over all 
aspects of production. Employees had none of the discretionary areas of 
responsibility afforded to those in condition A. After 12 months of produc- 
tion at the new factory, however, the company decided to extend autono- 
mous group work to this evening shift, largely at the request of the employ- 
ees themselves. At this point condition B became a second experimental 
group. This group initially had 25 individuals and grew to 60 by the end of 
the study. At the point of change to autonomous group work, the supervisor 
was promoted to support manager and the original manager transferred 
elsewhere in the company. 

There were two reasons for the conventional organization of the eve- 
ning shift at the outset of production. The first was to employ a supervisor 
and manager from a sister factory that was closing. The second was to 
avoid the risk of introducing autonomous group work with inexperienced 
employees whose schedule left them remote from the normal managerial 
support structure. DE 

Conditions C and D were two further nonequivalent control groups, 
representing day and evening shifts in a factory elsewhere in the organiza- 
tion where jobs had conventional designs. Supervisors allocated particular 
tasks to employees and took responsibility for operational decision making. 
The products and technology were very similar to those in the greenfield 
site, the same overall company policy was in force, and pay levels were 
equivalent. Condition C initially included 290 employees; during the course 
of the study the number available dropped to 227. Condition D had 64 
employees. 

All employees involved in the study received payment based on stan- 
dard rates reflecting an overall company policy of remunerating its employ- 
ees at a little above the median rate for manufacturing companies in the, 
area. There was no payment-by-results or group or company bonus scheme. 
However, overtime was available at an increased hourly rate of pay. 

As originally designed, the study involved three waves of measurement 
for conditions A and B, with measures after 6, 18, and 30 months of 
production, and two waves of measurement for conditions C and D at times 
corresponding to the first two collections of data for conditions A and B. 
The primary focus for the company was a full-scale evaluation of the green- 
field site, so the three measurements there were a matter of course. We 
negotiated the two measurements at the conventional site to enable 
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comparisons. In practice it was impossible to complete the final data collec- 
tion for condition D as tbe evening shift at the established factory was 
discontinued. The response rate was high at both sites throughout the study, 
varying from 70 to 95 percent of the total workforce. - ) 

As for predictions based on the overall research design, the expected 
pattern of results was clear. A simple effect for work design would show a 
pattern corresponding directly with the existence of autonomous workgroups 
and unaffected by other factors such as site, shift, or time. Let us assume for 
the sake of argument that autonomous workgroups would have a positive 
effect on job satisfaction. Then employees in conditions A1, A2, A3, B2, and 
B3 should record higher levels of job satisfaction than their counterparts in 
conditions B1, C1, C2, and D1. Within this global comparison, however, 
more precise predictions are embedded. For the first time period, for 
example, the predicted pattern would be of the form A1 > B1, C1, D1. Lack of 
difference between conditions C1 and D1 and lack of difference between 
conditions B1 and D1 would help to discount shift and site effects, re- 
spectively. This would strengthen the interpretation that if condition A1 
differs, the effect is attributable to autonomous group work. Taking the 
prediction over time, it is clear that condition B should change between the 
first and second measurement occasions (B1 < B2, B3), whereas conditions - 
A and C should remain constant, but at different levels, since condition A 
scores will be augmented from the outset by autonomous group work. The 
final column and row of Table 1 shows the overall pattern of results pre- 
dicted if work design rather than site, shift, or time has significant effects. 

This design avoided the threats to internal validity presented by history, 
maturation, testing, instrumentation, and statistical regression (Campbell & 
Stanley, 1966; Cook & Campbell, 1979a). To some extent it reduced the 
threats of “compensatory rivalry" and “resentful demoralization" (Cook & 
Campbell, 1979b: 81—82) because employees in conditions C and D were so 
distant from A and B— some 250 miles—as to allow little awareness of differ- 
ences in work designs. Selection and its interaction with other threats and 
mortality posed the major potential challenges to internal validity. We dis- 
cuss these later under Supplementary Analyses. 

Thus the overall research design had strong internal validity and poten- 
tially high external validity. It should be noted that we could not deploy the 
full design for the behavioral variables. During the first six months at the 
greenfield site, the reliability of the newly commissioned machinery was 
substantially inferior to that at the established site. This had both direct and 
indirect effects on operators' rates of productivity. To separate human perfor- 
mance from machine-caused production losses proved impossible, and per- 
formance indices at this stage were consequently unreliable. Similarly, given 
the small numbers in the groups at the greenfield site during the first six 
months, the incidence of labor turnover was too small to justify separate 
analysis. As the period of assessment lengthened and the numbers of employ- 
ges increased, a stable pattern emerged. Thus we assessed the relationships 
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of autonomous group work to these behavioral measures by gross compari- 
‘sons between the two sets of conditions; the Results section considers these 
relationships separately. 


Data Collection and Analyses 


Data on employees’ perceptions and attitudes were obtained by question- 
naires administered after each of the specified time periods. Confidentiality 
was assured, and completion during work time was voluntary. We obtained 
figures on labor turnover and performance from factory records. These indi- 
ces are described in greater detail under Results. Throughout the study we 
maintained regular contact with the sites and made periodic visits to observe 
work in progress and interview both shopfloor employees and managers. 

Self-report measures. For use as a manipulation check we developed a 
scale to measure perceived autonomy of workgroups. This measure included 
13 items based on the criteria offered by Gulowsen (1972) for designing 
autonomous workgroups. Thus, we asked respondents the extent to which 
their workgroups got involved in specific activities of which these are 
examples: organizing time for breaks, allocating jobs amongst yourselves, 
setting the pace of work, and solving local problems. Responses were mea- 
sured on 5-point formulas. Scale reliabilities (a) at each of the three measure- 
ment occasions were a = 0.82, 0.79, 0.84. 

Work attitudes were measured on scales developed and standardized in 
previous research on blue-collar employees (Clegg & Wall, 1981; Cook & 
Wall, 1980; Cook, Hepworth, Wall, & Warr, 1981; Warr, Cook, & Wall, 1979). 
These were: intrinsic job satisfaction (2 = 0.82, 0.85, 0.79);? extrinsic job 
satisfaction (a = 0.70, 0.74, 0.75); intrinsic job motivation (a = 0.87, 0.90, 
0.83); and organizational commitment (a = 0.77, 0.70, 0.74). Appendix A 
gives numbers of items in each scale and examples of items. 

The General Health Questionnaire (Goldberg, 1972) concluded the 
questionnaire. This self-administered screening test designed to detect 
diagnosable, nonpsychotic, psychiatric disorders measures important aspects 
of mental health. Previous work has established the reliability and validity 
of this measure for general populations (Goldberg, 1978). The present study 
-used the 12-item version, which has been shown to be both reliable and 
valid for employed samples (Banks, Clegg, Jackson, Kemp, Stafford, & Wall, 
1980; Banks & Jackson, 1982; Jackson, Stafford, Banks, & Warr, 1983; Wall & 
Clegg, 1981). This questionnaire asks people if they have recently lost much 
sleep over worry, felt constantly under strain, been able to enjoy their nor- 
mal day-to-day activities, been losing confidence in themselves, and so forth. 
Items have 4-point response formats ranging from “not at all” to “much more 
than usual." We computed overall scale scores for the three measurement 
occasions (a = .84, .83, .94). 

Workforce characteristics. The study is based on data from a total of 545 
employees, 83 percent of whom were women. Their average age was 31 


? Reliabilities here and subsequently are given for each of the three measurement occasions. 
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years; average age differed between shifts, evening-shift workers being older, 
but not between sites. Length of service naturally differed between the sites; 
it averaged 1.5 years at the end of the investigation for those in the new 
factory and had a mean of 5.7 years for those in the established factory. 
Statistical analyses. Statistical analysis proceeded in two main stages. 
First, we applied generalized least-squares analysis to the total data set in 
order to provide an omnibus test of the study's predictions: Second, we 
examined more specific predicted patterns of results within the total data set 
using (1) one-way analysis of variance across conditions and (2) one-way 
analysis of variance over time for both independent groups and repeated 
measures. Appendix B describes our rationale for his dual approach. 


RESULTS 


The findings are presented in three parts: (1) results from the primary 
analyses on the perceptual and affective outcome data, (2) supplementary 
analyses examining the viability of rival explanations for the above results, 
and (3) findings related to the performance, labor turnover, and wider 
organizational consequences of autonomous workgroups. 


Perceived Work Characteristics, Work Attitudes, and Mental Health 


The initial analyses defined a regression equation for each attitudinal 
measure as the dependent variable, with shift (day versus evening), site 
(conventional versus greenfield), time (three measurement occasions), and 
work design (presence versus absence of autonomous workgroups) as 
predictors. We derived parameter estimates by generalized least squares (see 
Appendix B). 

Throughout the analyses each dependent variable was examined 
separately. Such an approach may lead to misleading inferences when sev- 
eral dependent variables are highly related. In the present study, however, 
this was not the case, as is demonstrated in Table 2. This shows the correla- 
tions between the major variables in the study for the first time period only 
since the results were stable over time. Furthermore, dependent variables 
showed a diversity of response. These factors support the use of parallel 
analysis of each variable ignoring other dependent variables. 





TABLE 2 

Intercorrelations among Dependent Variables at Time 1? 
Dependent Variables 1 2 3 4 5 
1. Intrinsic job satisfaction = 
2. Extrinsic job satisfaction 77 - 
3. Intrinsic job motivation 30 42 — 
4. Organizational commitment 40 52 64 = 
5. Mental health? —.27 —.30 —.12 —.30 — 


*Sample size varied betweeen 271 and 344. 
bLower scores signify better mental health. 
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With respect to perceived autonomy of workgroups, employees under 
autonomous work conditions gave significantly higher responses than their 
counterparts experiencing conventional work designs (B = 0.68, p< .001}. This 
finding served as a manipulation check, confirming that employees per- 
ceived actual differences in work designs. Moreover, the size of this effect 
illustrates substantial differences between the two sets of work conditions, 
which thus allows a realistic test of the attitudinal effects of this form of 
work design. 

Work design also had effects on intrinsic job satisfaction (8 = 0.34, change 
in R2 of 2%, p < .001); extrinsic job satisfaction (B = 0.30, change in R2 of 1%, 
p<.01); and organizational commitment (B = 0.25, change in R2 of 1%, p < .01); 
but there was no effect for intrinsic job motivation (B — 0.16, n.s.) or mental 
health (B — -0.04, n.s.). Thus, this overall analysis supported three of the five 
predictions concerning individual attitudinal effects. 

These analyses provided an overall test of the effects of autonomous 
workgroups, but they do not suffice to test the specific patterns of findings 
that the research design allows and causal interpretation requires. Initially, 
these patterns concern differences between groups at each measurement 
time (Table 1, final column). Table 3 shows relevant results. These results 
show that for perceived autonomy of workgroups we obtained the specific 
predicted pattern of results. On the first measurement occasion, employees 
in condition A1 recorded substantially higher scores than the three control 
groups, which formed a homogeneous set. Similarly, on the second measure- 
ment occasion, when employees in condition B2 had changed to autono- 
mous group work, their scores matched those for condition A2 and differed 
from scores for condition C2. The only minor anomaly is at the third 
measurement, where a difference in mean scores exists between conditions 
A3 and B3, though both scores are high. These findings confirm those of the 
initial regression analysis and moreover demonstrate the full expected pat- 
tern of results. Thus, the manifest changes in work design clearly took effect 
and can be seen to persist for the duration of the study. _ 

To the extent that work design determines work attitudes and mental 
health, the patterns of scores recorded above should be duplicated for the 
affective measures. Such is the case for intrinsic job satisfaction. The differ- 
ences across conditions on the first and second measurement occasions 
showed the predicted patterns, and on the final measurement occasion the 
conditions did not differ. For extrinsic job satisfaction we obtained an equiv- 
alent set of findings. 

However, for the remaining three dependent variables differences across 
conditions do not correspond to the specific patterns required to indicate an 
effect for work design. Results for intrinsic job motivation and mental health 
confirm those of the initial regression analysis. But although the first analy- 
sis suggested that work design had an effect on organizational commitment, 
the second led us to reject such an interpretation. There were clear instances 
in which the levels of organizational commitment under traditional forms of 
work design matched those recorded for autonomous workgroups. Thus we 
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must reject the hypothesis of a work design effect on organizational commit- 
ment in this instance. 

The final set of primary analyses concerned predicted changes over time 
within groups (Table 1 final row]. Table 4 presents relevant findings. We ob- 
tained the predicted results for perceived autonomy of workgroups. Under con- 
dition B the mean score changed between the first and second measurement 
occasion and remained at the raised level at the final measurement. There 
was no corresponding change in either of the control conditions A and C. 

The results for intrinsic job satisfaction show the equivalent pattern. In 
condition B, the recorded change was between the first and second measure- 
ment occasions, with the final mean remaining level. Again, no such change 
emerged for conditions A or C. 

With respect to extrinsic job satisfaction, results are different. In condi- 
tion B there was an increase in mean scores following the introduction of 
autonomous workgroups (time 1 to time 2) that diminished by the final 


TABLE 
Results of Analyses of Variance for Repeated 





Condition As 


Dependent Type of Tukey 
Variables Analysis Time 17  Time2 Time 3 F df Test 
. Perceived work Repeated 45.9 49.4 49.7 1.0 2,20 = 
group autonomy Independent 44.0 45.6 47.0 «1.0 2,124 -— 
Intrinsic job Repeated 37.6 36.4 34.9 12 2,36 = 
satisfaction Independent 37.8 36.2 36.3 «1.0 2,145 SS 
Extrinsic job Repeated 47.3 43.8 41.4 6.5** 2,36 = 
satisfaction Independent 46.8 44.0 43.5 3.3* 2,141 A1, A2 > 
A2, A3 
Intrinsic job Repeated 36.1 36.5 33.4 5.6** 2,36 ~ 
motivation Independent 35.2 34.0 34.3 «1.0 2,145 - 
Organizational . Repeated 52.4 52.1 45.0 6.3** 2,30 B 
commitment Independent 50.7 48.1 46.3 3.0 2,138 A1, A2 > 
A2, A3 
Mental health Repeated 18.6 20.8 23.7 5.8* 2,34 ix 
Independent 18.8 19.6 21.0 3.2* 2,140 .A2, A3 > 
A1, A2 





Autonomous group work at all three measurement occasions. 
bTraditional job design at time 1; autonomous group work at times 2 and 3. 
"Traditional job design at both measurement occasions. 
dMeans are reported. Time 1 was after six months; time 2 after 18 months; and time 3 after 
*Lower scores signify better mental health. 
*p«.05 
"*p«.01 
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measurement occasion. This increase reaches statistical significance for the 
analysis of variance for repeated measures but just fails to do so in the 
analysis of variance for independent groups (F = 3.0, p < .06). It is as if imple- 
menting this form of work design enhances extrinsic job satisfaction over the 
short rather than the long term. This interpretation is consistent with the 
findings for condition A, in which the initially high level of extrinsic satisfac- 
tion declined over time. 

For the remaining three dependent variables, the third analysis again 
provided no evidence of an effect for autonomous workgroups. Results for 
condition B showed none of the predicted changes, and scores for the other 
conditions remained generally stable. This confirms both the first and sec- 
ond analyses. It is interesting to note that, in condition A, scores for mental 
health deteriorated over time. However, the effect is small, and the actual 
scores recorded at all three periods compare favorably with those of general 
populations (cf. Goldberg, 1978). 








4 
Measures and for Independent Groups over Time 
Condition B^ Condition Cc 
Tukey 
Time1 Time2  Time3 . F df Test Time1 Time 2 F df 
23.3 45.0 42.3 16.6** 2,16 - 31.4 308 «10 1,83 
26.2 42.4 38.7 23.8** 2,84 B2,B3 31.6 318 <1.0 1,283 
; > B1 
29.7 35.2 35.0 4.2** 2,8 - 30.0 28.9 2.3 1,113 
31.6 34.6 35.4 4.1** 2,103  B2,B3 29.5 29.7 «10 1,944 
> Bi ; 
39.4 43.8 40.0 3.6* 2,16 - 38.0 375 «10 1114 
411 44.7 42.8 3.0 2,103 - 37.5 381 «10 1341 
37.1 37.4 376 | «10 2,16 = 31.5 32.3 «1.0 1,108 
36.4 35.9 859 <1.0 2,107 - 313 32.3 1.8 1,348 
48.3 52.0 51.2 1.6 2,18 - 41.6 41.) «10 1,117 
48.6 51.1 48.3 2.2 2,108 - 41.8 41.4 «10 1,354 
21.1 19.8 23.9 1.5 2,14 - 21.7 214 «10 1,121 
22.2 20.9 21.7 «10 2,103 - 22.1 '214 2.4 1,353 





30 months. 
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Interpreted with a strict line in causal inference, the results of this study 
provide evidence that the effects of autonomous workgroups are strong, but 
restricted to perceptions of workgroup autonomy and job satisfaction. Intrin- 
sic job satisfaction shows an effect that persists over time, but it appears that 
the effect on extrinsic job satisfaction is less enduring. Autonomous 
workgroups have no clear effect on internal work motivation, organizational 
commitment, or mental health. 


Supplementary Analyses 


In a study such as this, random allocation across conditions cannot be 
achieved and labor turnover and other factors lead to missing values over 
time. Inevitably, rival explanations involving selection and mortality threaten 
causal interpretations. We consider selection issues first. Minor differences 
across groups existed from the outset in production processes and the ratio 
of men to women; in addition, there were differences in length of service. In 
order to examine the relevance of these threats, we repeated the three analy- 
ses described in the preceding sections with subsamples selected to be equiva- 
lent in these respects. Specifically, we conducted analyses in which (1) we 
restricted respondents from conditions C and D to those working on produc- 
tion processes that were virtually identical to processes performed by employ- 
ees in conditions A and B; (2) we omitted the small number of men 
respondents; and (3) we omitted employees in conditions C and D whose 
length of service exceeded that of those in A and B. As in the three primary 
analyses, findings showed unambiguous effects for work design on perceived 
workgroup autonomy, intrinsic job satisfaction, and extrinsic job satisfaction, 
but no effects for intrinsic job motivation, organizational commitment, or 
mental health. Thus, differences between conditions in type of work, gender 
composition, and length of service cannot account for the findings. 

Mortality is clearly a threat with respect to analysis of changes over time. 
The analysis of variance for repeated measures is necessarily based on a 
small proportion of the total population of respondents, and may therefore 
involve employees who do not represent the whole. Correspondingly, results 
from the analysis of variance for independent groups could provide spurious 
results because individuals lost or gained responded differently from those 
who remained throughout the study. To examine these threats we used 
t-tests to compare responses from newcomers and leavers in each experimen- 
tal and control condition for each dependent variable with parallel responses 
for the same measurement occasions from employees also present at other 
time periods. None of these analyses yielded statistically significant differ- 
ences. Thus we discounted threats based on mortality. 


Performance, Labor Turnover, and Organizational Effects 


From the outset an overall production target was set at the greenfield 
site. Based on criteria derived from work studies, this target specified the 
weight of output required according to the number of hours operators worked, 
called direct hours. In addition, during the first year the company set targets 
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for individual and group output efficiency. Throughout this early period the 
amount produced by employees in conditions A and B fell far short of 
targets, individual and group efficiencies were low, and machine utilization 
was poor. With increased reliability of the technology, developing skill lev- 
els amongst the operators, and increased personnel levels, production stead- 
ily increased. Output rose from 50 tonnes to over 400 tonnes® per working 
month; overall efficiencies per working month exceeded 80 percent; quality 
improved and scrap decreased. Towards the end of the second year, employ- 
ees in the two autonomous workgroup conditions reached output targets, 
and their outputs remained at that level for the remainder of the study and 
beyond. After three years the greenfield factory passed an assessment of its 
performance by the parent company. 

However, direct comparison with the conventionally organized sister 
factory proved impossible. Throughout the study, output and efficiencies at 
this site remained stable and high. Data on production and performance 
were complex, and the criteria for the inclusion of different types of produc- 
tion information varied between sites. Some examples: standards for time 
required to turn out given products varied; each site categorized different 
employees as direct or indirect labor; and the established site only had 
information available at departmental or factory levels, not for individuals. 

Nevertheless, examination of company records and discussions with 
relevant personnel did provide some insights into the meanings of the pro- 
duction data. First, these people felt that overall efficiency at the greenfield 
site was approximately 5 to 10 percent lower than that at the comparison 
site. However, they did not see lack of effort from individual operators as the 
major cause; rather, they saw employee performance at the two sites as 
broadly comparable. Moreover, at the greenfield factory, the target system 
largely determined output levels and employees’ efforts, and there was no 
requirement that operators should exceed the targets. Second, in attempting 
to maintain efficiency levels, the factories responded differently to market 
demands. Managers at the conventional site recruited or released short-term 
labor when demand fluctuated. The greenfield site maintained stable person- 
nel levels as far as possible in order to protect and foster the spirit and 
practice of team working. By the end of the study the autonomous workgroups 
could easily meet increased market demands by raising ouput and efficiency. 
Thus, the two sites implemented different organizational strategies to main- 
tain the efficient production of sweets. Neither site could be proved to be 
superior in performance. This was contrary to predictions for individuals | 
and groups at the greenfield site. 

One implication of this discussion of productivity is that productivity 
benefits could accrue at the greenfield site once targets had been met. This is 
because there were no indirect costs for supervisory staff. The established 
site had on the average one supervisor for every ten shopfloor operators, one 


3 A tonne = 1000 kg. (approximately 2240 lbs.) in the metric system of weights. 
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of whom acted as chargehand in the supervisor’s absence. Productivity gains 
at the greenfield site could of course be lost during periods of low demand 
when direct labor was underused. 

Contrary to predictions, examination of the data on labor turnover showed 
that the overall incidence for the autonomous work conditions (28% annually) 
exceeded that for the comparison conditions (22%). Analysis of the propor- 
tion of leavers to stayers over two-and-a-half years of the investigation yielded 
a statistically significant difference (x2 = 52.7, df=1, p < .001). Breaking this 
overall figure down according to reasons for leaving produced an interesting 
picture. With respect to resignations for personal/domestic and health reasons, 
there was no difference between autonomous and traditional work design 
‘conditions (x2 = 1.75, n. s.; x? < 1, n. s.). By comparison, the proportion of 
those who left for other jobs was higher in the autonomous work conditions 
(x2 = 19.72, df = 1, p < .001). Itseems unlikely this indicates general dissatisfac- 
tion as analyses showed no difference between leavers and stayers on job 
satisfaction or other attitudinal measures. It is more likely that this finding 
reflects differences in local labor markets because the autonomous workgroups 
were in a region having lower levels of unemployment than the conventional 
site's region. In addition, some individuals may not fit into a system of group 
working. But the most marked difference between the conditions with autono- 
mous and conventional job designs concerned dismissals for disciplinary 
reasons. Labor turnover in this category was at a rate of 8.2 percent in the 
autonomous workgroups, whereas in the comparison groups it was only 1.3 
percent. The difference between the two was highly significant statistically 
(x2 = 135.25, df = 1, p < .001), though the absolute number of dismissals was 
small, 

One explanation for this finding is that rate of dismissals will go up 
following any major recruitment program since a small proportion of employ- 
ees will not fit easily into their new organization. However, there is some 
evidence that the high number of disciplinary dismissals was also a direct 
consequence of autonomous group working. Interviews with managers and 
employees support this interpretation and illustrate some of the organiza- 
tional implications of sustaining autonomous workgroups. Line managers 
were aware that they had dismissed employees who, under conventional 
circumstances, they would not have required to leave. They attributed their 
taking a harder line in part to the pressures on them to produce and to 
develop the new site, but they saw a side effect of autonomous group work- 
ing as the main reason. The absence of supervisors meant that the few ` 
employees who stepped out of line or abused their freedom were not immedi- 
ately detected. Under conventional job designs, supervisors would take early 

-action and perhaps protect employees from managers. In autonomous 
workgroups, misdemeanors were typically more advanced when they came 
to light and warranted use of formal warning procedures. In other words, 
management at the greenfield site relied more on formal procedures and 
sanctions than did the conventional site’s managers. Interestingly, the 
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workgroups were loath to discipline members who consistently broke group 
norms and explicitly referred these problems to the support managers. 

Some findings from the interview program conclude this section. At the 
start-up of production at the greenfield site, managers reported high levels of 
stress, and interpersonal and interfunctional conflict. These reactions were 
due to the difficulties involved in developing a greenfield site and managing 
an untried system of group working. Indeed, some managers believed that 
the experiment with group working made it a harder and slower job to get 
the new site working effectively. Over time, with the solving of technical 
problems and the emerging success of the work groups, the managers’ stress 
levels fell, their attitudes toward their jobs improved,* and the management 
team became more cohesive. However, they still perceived their roles as 
highly stressful. As one manager stated, “The mental, physical, and emo- 
tional pressure on the factory management with this type of organization 
should not be underestimated and, dare I suggest, is significantly greater 
than for our equivalents in other factories." Indeed, the support managers 
felt the greatest level of pressure both in terms of the production output 
required and in terms of the difficulties involved in communicating, 
intervening, disciplining, and providing the resources and conditions neces- 
sary for up to 70 employees in six teams to work together effectively and 
efficiently. As a result the factory employed four different day-shift support 
managers in the first three years; this has now stabilized, The difficulty 
was exacerbated because it was not possible to promote personnel within the 
factory via supervisory positions. Hence, new job incumbents needed time 
to be socialized into the organization but immediately experienced the 
problems of managing semiautonomous workgroups. 

But in pointing out these problems, we should not lose sight of the 
benefits of autonomous workgroups. In this study, these benefits were suffi- 
cient to ensure that operators and managers alike supported the system, 
which is to be continued for the foreseeable future. During interviews, for 
example, we saw clearly that shopfloor employees actively enjoyed the sys- 
tem and its attendant responsibilities. None preferred conventional forms of 
work design of which all but a few had prior experience. All were pleased 
not to have supervisors breathing down their necks and spoke very highly of 
their work conditions. Employees at the new factory particularly enjoyed not 
having to clock on and off and also appreciated the various other ways in 
which management treated them as mature adults. As one employee stated, 
"After 20 years of working at two other firms, the conditions here are . 
excellent—managers treat you as people." Indeed, another suggested that 
"work groups enable us to get things off our chests and do something about 
jt." One consequence of this satisfaction was a good industrial relations 
climate in the factory. 


* Managers completed the same work attitude measures and General Health Questionnaire 
administered to the shopfloor employees (described in the Method section) both at the start-up 
of production and 30 months later. 
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DISCUSSION AND CONCLUSIONS 


This field experiment showed that autonomous workgroups had specific 
rather than wide-ranging effects on employee attitudes and behavior. Although 
implementation of this work design substantially enhanced intrinsic job 
satisfaction, it did not demonstrably affect job motivation, organizational 
commitment, mental health, work performance, and voluntary labor turnover. 
At an organizational level, implementation made productivity benefits possi- 
ble because the autonomous workgroups reduced indirect labor costs; a side 
effect was increased dismissals. Employees clearly appreciated the autono- 
mous work system. On balance, managers did too, though clearly there were 
costs in terms of personal stress arising from the difficulties involved in 
managing and maintaining the system. 

Overall, the most striking aspect of these findings is the restricted range 
of effects on individual attitudes compared with the range predicted from 
the literature. One possible explanation for this is that the form of work 
design introduced did not differ sufficiently from conventional practice. 
However, this is implausible. Objectively, the system of autonomous work- 
groups was a radical departure for this organization, and the results pertain- 
ing to employees' perceptions of autonomy of workgroups testify that those 
involved experienced it as such. As in any investigation, a second possible 
explanation is that the findings are idiosyncratic. This possiblity can never be 
logically dismissed but seems implausible. Work design theory applies gen- 
erally to shopfloor jobs, and there is no reason to dismiss the present setting as 
atypical in this regard. The most likely explanation for the restricted range of 
effects on employee attitudes is that the rigorous research design eliminated 
causal interpretations that weaker designs would have supported. Even within 
the present study it is easy to see how this could have occurred with a 
different research design. If for example we had restricted the design to 
conditions A and C, we would have inferred effects on job motivation, organi- 
zational commitment, and mental health, as Table 4 makes evident. Much of 
the previous research from which we took predictions is of such a compara- 
tive nature. The general point is that more rigorously designed studies serve 
to eliminate a greater number of hypotheses. 

An interesting aspect of the present study is that it analyzed two differ- 
ent approaches to implementation of autonomous group working. Employ- 
ees in condition A were hired for jobs under the new form of work design, 
whereas employees in condition B had their jobs redesigned. As Roberts and 
Glick commented, “the act of having one's job changed may have different 
influences on a person than does his or her signing on to impoverished or 
complex tasks" (1981: 210). However, the results of the present study show 
quite clearly that both approaches had the same effect. Both improved job 
satisfaction. This suggests that it is the nature of autonomous workgroups 
that is the main causal factor in determining employees' attitudes. 

The finding that the satisfaction effect is stronger and more enduring for 
intrinsic rather than extrinsic aspects is interesting. This is consistent with 
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theory that emphasizes the importance of job content to people’s experience 
of work itself. However, the finding also suggests that satisfaction ceases to 
overflow to wider contextual circumstances over time. This reflects earlier 
theory (Herzberg, 1966) ascribing secondary importance to conditions of 
employment. , 

The most problematic outcome of this study is the absence of effects for 
intrinsic job motivation. Implicitly or explicitly, this concept is at the core of 
most approaches to job design (Davis, 1966; Hackman & Oldham, 1976).5 In 
the present case it might be argued that failure to find such an effect resulted 
from the use of an insensitive instrument. The intrinsic job motivation mea- 
sure yielded responses distributed towards the top of the scale, indicating 
that a ceiling effect was to some extent operative. Although this distribution 
supports the insensitivity argument, it is not compelling, since the scale was 
sensitive enough to reveal statistically significant differences across condi- 
tions and over time that did not correspond to effects of work design. On the 
basis of this study, it would seem that no demonstrable motivational benefit 
derives from autonomous group working. To the extent that autonomous 
workgroups reflect some general principles of job design this finding is of 
considerable significance in suggesting that major theoretical tenets may be 
invalid. 

The absence of an unambiguous effect on organizational commitment is 
less troublesome given recent theoretical developments. Though earlier work 
(Emery, 1959) led us to expect such an effect, which was also an outcome the 
company desired, recent theory (Hackman, 1983) suggested that autono- 
mous workgroups may have their effect on commitment to workgroups rather 
than to organizations. Our investigation did not address this possibility, 
which deserves further empirical attention. Similarly, job design had no 
effect on the other exploratory variable, mental health. It may be, however, 
that this variable is still important in the context of impoverished jobs 
(Broadbent, 1985). 

Current theory also offers several reasons for the lack of a performance 
effect. The factory we studied did not implement all the design criteria for 
autonomous workgroups that current theory has made explicit (e. g. Hackman, 
1983). In particular, rewards were distributed to individuals rather than 
groups, performance feedback tended to mix information between the indi- 
vidual and group levels, and resources for training were low. It is possible 
that these deficiencies prevented the autonomous workgroup system from 
having its predicted direct effect on performance. 

We turn now to the practical implications. The results show that autono- 
mous group working is a viable proposition that is appreciated by those who 
experience it. Moreover, it may have clear economic benefits. This is not, as 
theory predicts, because groups enhance operators' motivation and effort; 
economic benefits stem instead from the logic of the groups themselves. 
With decision making delegated to shopfloors, the need for supervision 


5 See Clegg (1984) for an alternative theoretical account. 
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declines or vanishes, indirect labor costs decrease, and productivity benefits 
can accrue. Given appropriate organizational designs, resources, and mana- 
gerial structures, task performance might also be improved (Hackman, 1983). 
But it is also clear that such work systems have their costs, albeit ones that 
are difficult to quantify because they chiefly involve managerial time and 
effort. Furthermore, even these apparently straightforward practical implica- 
tions need to be qualified. It should not be forgotten that the findings obtained 
were from a greenfield setting. This setting had the advantage of aliowing the 
implementation of a radical and well-developed form of autonomous 
workgroups. But by the same token this setting limits generalizability. It is 
reasonable to assume that in established organizations the threat autono- 
mous workgroups pose for the nature and continued existence of supervi- 
sory jobs might reduce or outweigh their potential practical benefits. Similarly, 
the emphasis on employees having many skills may cut across agreed 
demarcation lines and customs and practices of work, particularly in union- 
ized settings. In established settings, people might resist such change. Thus, 
in such contexts, the process of change may be as important as its content. 
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APPENDIX A 


The questionnaire used in this study included the following four scales to measure attitudes 
towards work (Warr, Cook, and Wall, 1979). All items had appropriately worded 7-point response 
formats. 

The intrinsic job satisfaction scale had seven items. Examples are: 

How satisfied are you with— 

the freedom to choose your own method of working? 
the amount of responsibility you are given? 
the opportunity to use your abilities? 


The extrinsic job satisfaction scale had eight items. Examples are: 
How satisfied are you with— 

the physical work conditions? 

your rate of pay? 

the way your firm is managed? 


The intrinsic job motivation scale had six items. Examples are: 
Do you agree or disagree with the following— 


1986 Wall, Kemp, Jackson, and Clegg 303 


I take pride in doing my job as well as I can. 
My opinion of myself goes down when I do this job badly. 
I try to think of ways of doing my job effectively. 


The organizational commitment scale had nine items. Examples are: 
Do you agree or disagree with the following— i 
I am quite proud to be able to tell people.who it is I work for. 
I feel myself to be part of the organization. 
In my work I like to feel I am making some effort, not just for 
myself but for the organization as well. : 


APPENDIX B 
Statistical Analyses 


Generalized least squares. The design of this study had three main features that precluded 
the use of standard methods for analysis of variance. First, the repeated measures element of the 
panel design had to be incorporated in order to (1) increase the power of statistical tests, and (2) 
provide for the repeated measures that, if ignored, would have biased significance tests. Second, 
the growth in personnel levels at the greenfield site together with a reduction in the workforce at 
the established site necessarily resulted in a large number of missing values (3896). Finally, the 
study did not constitute a full factorial design. Autonomous workgroups were introduced in one 
site only, leading to partial confounding between site and work design factors. 

What was required was an approach that (1) capitalized on the information inherent in the 
repeated measures element, (2) used the independent groups component appropriately, and (3) 
coped with the design's lack of balance. The method of analysis chosen was based on the 
so-called cross-sectional time series designs (Hannan & Young, 1977; Nerlove, 1971). discussed 
in the econometric literature. Essentially, they give a split-plot type of ANOVA design, with a 
between-group term containing the hypotheses of interest and a within-group term that pro- 
vides for repeated measurement. 

Hannan and Young (1977) and Simonton (1977) discussed several approaches to parameter 
estimation, but the most common method is to use generalized least squares after estimating the 
autocorrelation parameter from the data. 

The first step of estimating the autocorrelation parameter was applied only to data for those 
respondents present on all three measurement occasions, with respondents represented by 
dummy variables (Hannan & Young, 1977). We used the regression weights for the respondents' 
dummy variables to calculate the autocorrelation coefficient by the formula given by Hannan 
and Young (1977: 55). We then adjusted the raw data from the whole set of respondents on the 
basis of the derived autocorrelation estimate. Finally, we ran ordinary-least-squares regressions 
on the adjusted scores for the total sample to give generalized-least-squares estimates that are 
unbiased estimates of the terms of interest and their standard errors. At this final stage, we used 
dummy variables to code site, shift, and type of work design. Measurement period was also 
included in the equation. 

One-way analysis of variance. The generalized-least-squares analysis has the advantage of 
parsimony; for each dependent variable it provides an omnibus test of relevant hypotheses 
using the whole data set with statistical control of relevant parameters. However, it does not 
provide directly for the clear examination of the specific patterns of predictions across condi- 
tions or over time. The second stage in the analysis consequently involved analysis of variance 
applied in two steps. i 

First, one-way analysis of variance for independent groups was used for each dependent 
variable at each measurement occasion separately. Where a statistically significant effect was 
obtained, we applied an a posteriori Tukey contrast test (Winer, 1971) with a prescribed signifi- 
cance level of p < .01. 

The second and final step in the analysis concerned change over time within each condition. 
We conducted both analysis of variance for repeated measures (for those respondents with valid 
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data at all measurement occasions) and analysis of variance for independent groups for each 
dependent variable. The repeated-measures analysis is potentially the more sensitive of the two, 
but has far fewer degrees of freedom because of missing values; the independent-groups analy- 
sis is more conservative since the error term is inflated by the inclusion of subject variance, but 
at the same time encompasses all respondents. We applied a posteriori contrast tests as before, 
Together, these two analyses provided a useful way of handling the data, subject to checks 
on the representativeness of the repeated-measure subsample and of distortions due to mortality 
effects in the independent-groups analysis. The Results section describes the outcome of these 
checks. f 
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A field experiment conducted with 53 hard-rock miners in an under- 
ground metal mine examined the effects of two interventions, team 
building and goal setting, on miners’ productivity and strategy develop- 
ment. The study employed a 2 x 2 factorial design with random assign- 
ment of participants to the goal setting condition, and it used objective 
criteria of performance. Unfortunately, the results were inconclusive. 
Although slight improvements occurred in some measures of perfor- 
mance and strategy development, it is not clear that these improve- 
ments were results of the interventions. More of these improvements 
occurred in the goal setting than in the team building condition. Limita- 
tions in the research design and factors in the experimental setting that 
could not be controlled led to equivocality of these results. 


Numerous techniques drawn from behavioral science for improving indi- 
vidual and crganizational performance figure in the contemporary manage- 
ment scene. Practicing managers are offered a veritable smorgasbord ranging 
from alphabet soup (QWL; OD; MBO; TA; OB Mod; Theories X, Y, Z; QC; 
PDM; etc.) to nuts. Naturally, their adherents and practitioners make claims 
for the efficacy of these techniques and counterclaims are forthcoming from 
critics and skeptics. Three reviews of the empirical literature (Cummings, 
Molloy, & Glen, 1975; Huse, 1980; Locke, Feren, McCaleb, Shaw, & Denny, 
1980) generally agreed that only a few intervention techniques have been 
shown to be effective through well-designed and well-executed research 
investigations. We believe that rigorous evaluation of applied behavioral 
science programs is desirable and that field experiments in organizational 
contexts are particularly needed. This study describes a field experiment 
that examined the effects of two prominent intervention techniques, team 
building and goal setting, on individuals’ productivity. 


TEAM BUILDING 


One of the most popular intervention techniques in organization devel- 
opment (OD) is team building (French & Bell, 1984; Porras & Berg, 1978). The 
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primary objective of team-building activities is to increase the effectiveness 
of work teams. This is usually achieved through improving problem solving 
and interpersonal relations in teams. Team building is not a well-defined 
concept, however. Organizations employ many variations; the emphases of 
programs change over time; and programs are often situation- and problem- 
specific. Beer (1976) described four models of team building: a goal-setting 
model, an interpersonal relations model, a role clarification model, and the 
managerial grid model. Dyer (1977) described a more general model of team 
building based on problem salving in which team members identify major 
problems, generate relevant information, engage in problem solving and action 
planning, and implement and evaluate action plans. We refer to this task- 
oriented problem-solving technique for teams as team building/problem 
solving. The assumption underlying this type of intervention is that work 
teams become more effective by solving their major problems together. The 
team-building program studied in this research used team building/problem 
solving techniques that are operationally defined in the Methods section. 

Research assessing the effectiveness of team building has been inconclu- 
sive (DeMeuse & Liebowitz, 1981; Nicholas, 1982; Woodman & Sherwood, 
1980). Positive results were found in 29 of the 36 studies (8096) reviewed by 
DeMeuse and Liebowitz, in 2 of the 4 studies reviewed by Nicholas, and in 
19 of the 30 studies (6396) reviewed by Woodman and Sherwood. Neverthe- 
less, these authors concluded that they could make no firm statements regard- 
ing the efficacy of team building with respect to its effects on the perfor- 
mance of individuals and work groups. They based their conclusions on the 
fact that the research designs of the studies reviewed were generally weak, 
and the findings thus lacked internal validity. Moreover, with the exception 
of some reported by Nicholas, the studies measured subjective data such as 
employees' perceptions and attitudes and usually ignored objective data on 
performance. Finally, they often vaguely and imprecisely described inter- 
ventions. 

Buller's (1982) review of the literature on team building yielded only 
nine studies that implemented clearly defined team-building interventions 
and measured effects on objective criteria of performance. Of these nine 
studies, six reported positive effects on performance. However, all but one 
(Margulies & Raia, 1968) of the successful team-building interventions were 
confounded with goal setting or other activities; thus it was impossible to 
determine which technique produced the results. Margulies and Raia's study 
used a preexperimental design and thus had limited internal validity. 

An additional problem with previous research on team building is its 
failure to specify and test relationships between independent variables, causal 
mechanisms, and criteria of performance. It is simply not clear why team 
building affects performance, if it does at all. Nicholas (1982) suggested that 
team building may lead to improved performance through ‘modification of 
attitudes, values, problem-solving techniques, and interpersonal and group 
processes” (1982: 535). A recent model presented by Buller (1983) suggested 
that team building potentially affects a number of individual, group, task, 
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and organizational factors that in turn improve performance. This model 
predicts that the performance of a team will improve to the extent that it can 
identify and solve major debilitating problems in one or more of these areas. 
However, existing models are weak in defining specific theoretical relations. 
Perhaps Hackman’s (1976) model of the effects of workgroups on the perfor- 
mance of individuals is the most relevant to this study and the most 
parsimonious in describing how team building might work. He argued that 
individual performance is a function of an individual’s ability, motivation, 
effort, and strategies for performance—all moderated by the nature of the 
task being performed. To the extent that immediate workgroups enhance or 
hinder these mechanisms, they enhance or hinder performance. Certain task 
contingencies may affect which of the causal mechanisms is most important 
to improved performance. 

This research was not an attempt to develop or validate a theory of team 
building. The primary purpose of the study was to test team building’s 
independent effects on performance using objective criteria of performance. 
However, the research directly examined the effects of team building on 
development of task performance strategies, an individual cognitive mech- 
anism. To our knowledge, no previous research has directly tested such a 
cognitive effect of team building. 


GOAL SETTING 


The importance of goal setting for motivating employees’ performance 
was first explicitly recognized in the management literature by Taylor (1911); 
Locke (1968) translated the concept into a contemporary theory of task 
performance. According to Locke’s theory, if workers accept hard, specific 
goals, performance improves. In the most thorough review of goal setting 
to date, Locke, Shaw, Saari, and Latham (1981) viewed goal setting as primar- 
ily a motivational phenomenon, but acknowledged that it also involves cog- 
nitive elements. From a motivational standpoint, goal setting affects the 
direction, amplitude (effort), and duration (persistence) of action. From a 
cognitive perspective, goal setting is thought to affect strategy development 
or ways to accomplish goals. The first three of these mechanisms are rela- 
tively direct in their effects on performance. Strategy development is more 
indirect. Locke and colleagues (1981) noted that this concept 


involves developing strategies or action plans for attaining one's 
goals. Although strategy development is motivated by goals, the 
mechanism itself is cognitive in essence; it involves skill devel- 
opment or creative problem solving (1981: 132). 


Goal setting, the subject of extensive experimental research in both labo- 
ratories and field settings, has been found to be a robust technique for improv- 
ing task performance (Latham & Yukl, 1975; Locke et al., 1981). Most of this 
research has focused on the first three mechanisms by which goals affect 
performance. Few studies have directly examined the effects of goal setting 
on strategy development, although several studies have inferred such effects. 
For example, Bandura and Simon (1977) noted that overweight clients in a 
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weight clinic who had specific goals for mouthfuls eaten altered their eating 
habits. Some individuals ate low calorie foods that did not count in their 
quotas; some engaged in more planning than before. Latham and Baldes 
(1975) observed similar behavior in a study of drivers of logging trucks. 
Some of the drivers who were assigned specific challenging goals for trucks’ 
weights recommended small changes in their trucks that let them more 
accurately judge weight. In general, participants in these studies with hard 
specific goals engaged in more creative problem solving than participants 
with no goals. 

In another relevant study, Terborg (1976) obtained separate measures of 
the effects of goal setting on individuals’ efforts and the direction of their 
behavior on a reading task in a simulated organization. The study operation- 
ally defined direction of behavior as the extent to which individuals employed 
appropriate behaviors or action plans to accomplish the task. This was essen- 
tially a measure of strategy development. The results showed that goal set- 
ting was related to both effort and direction of behavior, which in turn were 
related to performance. Specifically, individuals who set hard specific goals 
on the reading task spent more time on the reading materials (effort) and 
were more likely to use learning strategies relevant to their goals (direction 
of behavior) than those who did not set goals. 

Finally, scholars have observed that strategy development may be espe- 
cially important in complex tasks because if people do not develop strategies 
that are relevant to goals, increased motivation caused by goal setting may 
not enhance performance (Bavelas & Lee, 1978; Christensen-Szalanski, 1980). 
The nature of tasks may therefore be an important variable determining 
whether or not goal setting will stimulate development of strategies, lead to 
improved performance, or both. 

The studies discussed in the preceding paragraphs support conceptual 
arguments that both cognitive and motivational mechanisms influence per- 
formance of tasks (Naylor, Pritchard, & Ilgen, 1980; Terborg & Miller, 1978). 
They provide some evidence that goal setting affects cognitive as well as 
motivational determinants of performance. To date, only one reported study 
has systematically examined the effects of goal setting on strategy develop- 
ment. In a laboratory experiment, Shaw (1983) found that specific hard goals 
led to the development of more task strategies than no goals did. The present 
study investigated the effects of goal setting on strategy development and 
task performance in a field setting. 

In summary, our hypotheses were straightforward: team building and 
goal setting will each have significant, positive effects on task performance 
and strategy development. 


METHODS 
Context and Setting 


The present research was part of a larger organization development (OD) 
project designed to demonstrate the effectiveness of OD techniques for improv- 
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ing safety and productivity in mines. Conducted at a silver mining company, 
the program consisted primarily of team-building activities at all levels of 
the organization but also involved a variety of other activities. These included 
a revision of the company’s performance appraisal system to place more 
emphasis on safety, an assessment of data processing needs, development of 
a program for training supervisors, and the experiment reported in this paper. 
_ Although the other activities potentially confounded the effects of the 
experiment, it is important to note that only the supervisory training pro- 
pram also directly involved hourly workers. Further, the organization did 
not implement this training program until after we had completed our study. 
The entire OD project took approximately three years; the field experiment 
extended over eight months. 8 

The site for the field experiment was an underground mine producing 
silver, lead, and zinc. The mine employed 292 persons with approximately 
220 hourly employees involved in underground production operations. The 
average hourly employee was 38 years old, had a twelfth grade education, 
and had 12 years of work experience. About 60 percent of the workforce 
belonged to an international union. The mine was a hard-rock mine in which 
ore-bearing rock in vein structures was mined in small working places called 
stopes. 

As part of the overall OD program, a team-building intervention was 
conducted with approximately half of the hourly underground employees 
and their supervisors at the mine. Adding the goal-setting induction gener- 
ated a 2 X 2 factorial design and thereby allowed the controlled investigation 
of both behavioral science interventions in a single field experiment. Thus, 
the experiment was a study within a study and was more limited in scope 
than the overall OD effort. To our knowledge no one has previously imple- 
mented and evaluated either team building or goal setting in the mining 
industry. Additionally, few studies of team building have been conducted 
with hourly employees as participants. 


Study Design 


The units of analysis for this study were stopes, small underground 
working places in the mine. Twenty of the mine's 23 production stopes were 
assigned to the treatment conditions; 3 stopes were excluded because they 
were not operational during the course of the experiment. To implement the 
team-building condition, the mine superintendent nonrandomly assigned 12 
stopes consisting of 36 miners to undergo team building; the 8 remaining 
stopes were assigned to the no-team-building condition. The reasons the 12 
stopes were assigned to team building were: (1) four intact work crews oper- 
ated them, each crew consisting of a shift boss (first-line supervisor), stope 
miners, and hourly support personnel; (2) they constituted two complete 
production levels underground; (3) they included opposite shift crews, with 
one crew working the day shift and the other crew working the afternoon 
shift on the same level, and the superintendent wanted to address problems 
that existed between opposite shifts; (4) the locations of these stopes were 
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important to overall mine production; and (5) the superintendent wanted at 
least half of the hourly workforce to get an exposure to team building. 

Two weeks after the team-building intervention was completed, the goal- 
setting condition was implemented by randomly assigning 10 stopes consist- 
ing of 26 miners to the goal-setting condition and 10 stopes consisting of 27 
miners to the no-goal-setting condition. Thus, a total of 53 miners, all men, in 
20 production stopes were assigned to experimental conditions as shown in 
Figure 1. The resulting 2 x 2 factorial design was quasi-experimental because 
it lacked complete random assignment to conditions and simultaneous induc- 
tion of the two treatments (Cook & Campbell, 1979). Figure 2 shows the 
timeline for the experiment. 

Two miners worked most stopes, but some stopes had one, three, or four 
individuals. Since it was operationally impossible to determine every miner's 
individual performance, we used performance of stope teams as the unit of 
analysis. Clearly, this situation presents problems in distinguishing between 
the performance of individuals and groups. ` 

The stope miners' task. Stope miners are the primary production work- 
ers and major source of increases or decreases in productivity in such mines. 
Typically, two miners work in each stope for long periods of time, working 
both alone and together. Each mines a horizontal cut about 100 feet long in 
opposite directions from the raise, a vertical opening on the vein. All other 
times they help each other in certain phases of the mining cycle. Miners 
share equal responsibility for a stope's production. As the rock is mined out, 
they pump a sand slurry back in to raise the floor of the stope and the mining 
cycle of “drill, blast, and muck’! begins again. All miners use similar equip- 


FIGURE 1 
Factorial Design for the Experiment 


Goal Setting No Goal Setting 


Team Building n = 6 stopes n — 6 stopes 
(18 miners) (18 miners) 


No Team Building n = 4 stopes n — 4 stopes 
(8 miners) (9 miners) 
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ment and mining methods. A crew of support people assists the stope miners 
by bringing necessary supplies to the work area and helping to remove the . 
ore. Each level of the mine has from one to six working stopes, and one 
support crew serves all stopes on a level. 


Contact between Researchers and Participants 


Sponsored by the U.S. Bureau of Mines as a consultant for the three-year 
OD project, the second author conducted all of the team-building sessions 
described later in this section. The first author, also sponsored by the Bureau 
of Mines, was a research assistant during one year of the OD project. As a 
research assistant, he observed but did not participate in the team-building 
sessions with underground crews, conducted the goal-setting intervention, 
and collected and analyzed the data reported here. 


Independent Variables 


The experimental treatments were team building/problem solving and 
goal setting. Each variable had two conditions, the presence or absence of the 
treatment. 

Team building/problem solving. This variable was operationally defined 
as a planned series of meetings, facilitated by an outside consultant, with a 
team—a group of people having common organizational relationships and 
goals. These meetings must be designed specifically to improve the team’s 
accomplishment of tasks by (1) developing problem-solving procedures and 
skills and (2) then solving the team’s major problems. 

The team-building induction was implemented through a series of six 
45-minute meetings over a period of ten weeks. Entire crews, including shift 
bosses, attended meetings; groups included from 12 to 20 individuals. The 
team-building sessions were task-centered; they focused on surfacing and 
solving problems that got in the way of working effectively, efficiently, and 
safely. The second author facilitated meetings as an outside consultant and 
helped the crews to bring problems related to their task to the surface. These 
groups identified task-related problems, generated ideas concerning the causes 
of the problems, solved problems and planned actions, and implemented 
and evaluated plans. 

It should be noted that the team-building manipulation in this study was 
a global intervention. By definition, team building works with problems that 
teams identify. The specific problems addressed differed somewhat among 
the four crews. However, the basic process was the same; all team-building 
activities contained elements of the operational definition. They were (1) 
planned, (2) facilitated by an outside consultant, (3) engaged in by intact 
workgroups, (4) designed to develop the problem-solving capability of the 
crews, and (5) intended to solve major problems. 

Description of specific team-building activities. Team building for each 
crew began with the consultant providing information on the background 


1 To “muck” means to remove the broken rock. 
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and purpose of the meetings. It was emphasized that the meetings’ primary 
purpose was to improve the efficiency and safety of underground operations 
and to help make the mine a better place in which to work. Participants were 
told that all information gathered in the meetings would be used to generate 
constructive ideas for improvements and that their active participation was 
voluntary but welcome. The consultant then raised three specific questions: 
(1) How can we do the job better? (2) How can we make this a better place to 
work? (3) How can we make this a safer place to work? Focusing on these 
questions, crew members interacted with their shift bosses, with one another, 
and with the consultant. The consultant publicly recorded major points from 
the discussions on flip charts. He also condensed the problems, issues, and 
solutions from each meeting into major themes and fed them back to the 
crews via handouts at the beginning of each subsequent meeting. The mine's 
management also got a record of major themes for diagnosis of problems and 
planning of action. 

Four major issues emerged from the team-building sessions: (1) poor 
communication between opposite shift crews and between miners and mine 
management; (2) poor working conditions underground in terms of both 
safety and performance; (3) poor repair and maintenance of equipment; and 
(4) pay and bid systems that were perceived as unfair. The crews' discus- 
sions generated a variety of suggestions for addressing these problems. The 
crews implemented some solutions directly. For example, in an effort to 
improve communications between opposite shifts, they scheduled daily meet- 
ings between opposite shift crews; none had existed before. The consultant 
facilitated several of these meetings between crews. Crews also acted on 
some suggestions involving underground working conditions, such as stor- 
ing timber in safer, more accessible areas, cleaning up misplaced tools and 
materials, improving repair of tracks, and taking better care of drilling 
equipment. 

During the fifth of the series of team-building meetings, the mine man- 
ager and superintendent interacted with the crews on issues that directly 
involved management of the mine and were beyond the decision making 
scope of the crews. Some of the problems were acted on immediately. For 
example, management freed some resources to create additional storage space 
underground. Some problems, primarily those dealing with maintenance 
and the pay system, were acknowledged as important and placed under study. 
Other issues, most notably the need for larger facilities and parking areas, 
were postponed for later resolution, since the mine had plans for future 
expansion. As management subsequently acted on problems, they made those 
actions known to the crews via the meetings and notices posted on a bulletin 
board. 

Goal setting. This variable was operationally defined as the setting of 
specific, difficult, but attainable production goals and the receiving of weekly 
feedback on performance in relation to those goals. Performance feedback 
was an intergral part of the manipulation because previous research has 
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found feedback to be a necessary condition if goals are to influence perfor- 
mance (Locke et al., 1981). 

The goal-setting induction was implemented as follows. Two weeks 
prior to actual goal setting with stope miners, those shift bosses with miners 
assigned to the goal-setting condition received one hour of training, con- 
ducted by the first author, on setting goals and giving feedback on perfor- 
mance. Next, the goal-setting induction occurred in a one-hour meeting, also 
conducted by the first author, in which stope miners and shift bosses received 
graphs depicting the past 18 months of production in their stopes. They were 
asked to set specific, difficult, but attainable goals for tons per manshift and 
grade of ore? for the next three months. Goals were set participatively, with 
the miners and their bosses establishing mutually agreed upon goals that met 
the stated criteria. After goals were set the researchers recorded them on a 
feedback sheet and gave copies to each miner and shift boss. The feedback 
sheets, which were updated each week, included (1) graphs that displayed 
the recent output of that individual’s stope in terms of tons per manshift and 
grade of ore, and (2) a statement that gave the goals that had been agreed on 
for the three-month trial period and described the particular stope’s average 
output to date. Shift bosses were instructed to present feedback sheets to 
miners in the goal-setting condition weekly. Direct observations and post- 
experimental interviews confirmed that shift bosses did provide this regular 
feedback. 


Dependent Variables 


Two productivity measures were generated, one for quantity of produc- 
tion and one for quality of production. Tons per manshift is a standard 
measure of productivity in mines. It measures the amount of rock broken and 
removed from a stope per person per eight-hour shift. We used this variable, 
which was available from mine records every two weeks, as a measure of 
quantity of production. During the eight-month experimental period, 16 mea- 
sures of tons per manshift were taken; the first two months of the period 
provided baseline data. 

Grade of ore is also a standard measure of productivity in mines. It 
measures the numbers of ounces of silver and lead per ton of ore taken out of 
a stope. This variable, available from mine records monthly, was used as a 
measure of the quality of production. Eight measures of grade of ore were 
taken during the experimental period, including two measurements of base- 
line data. 

A third dependent variable was strategy development, operationally 
defined as changes in miners’ behaviors related to improving grade of ore. A 
document known as the grade strategy checklist provided a measure of grades. 
It listed seven specific observable behaviors related to grade improvement 
that stope miners could control. The mine geologist and the first author 
developed the checklist, which contained the following items: controlling 
the height and width of the stope, mining with the ore vein, putting in 


2 These two measures are defined in the section on dependent variables. 
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sufficient ground support, using proper slushing techniques, cleaning out 
rock carefully, and accurately locating the brow for a new cut. The mine 
geologist, who was blind to experimental condition, completed the checklist 
twice for each stope—once before the first goal-setting induction and once 
three months later. He rated each behavior according to the following 
scheme: good = 15; average = 10; below average = 5. A total of 105 points was 
a perfect score. The grade strategy checklist represented an unobtrusive 
measure of the extent to which miners were engaging in appropriate be- 
haviors to improve grade of ore. It further served as a direct test of the hy- 
pothesis that goal setting and team building lead to the development of 
strategies. 

Measures of goal acceptance and goal difficulty were obtained after the 
goal-setting inductions to check for any differences among conditions on 
these variables. Following Latham and Steele (1983), we measured goal accep- 
tance and goal difficulty with one face-valid question each; participants 
responded to these questions on a 5-point Likert-type format. 


RESULTS 
Productivity 


Table 1 summarizes the means for all three productivity variables before 
and after the experimental manipulations. Figures 2, 3, and 4 display plots of 
the dependent measures over time. 


TABLE 1 
Summary of Means for Tons per Manshift, Grade of Silver, 
and Grade of Lead before and after the Interventions 





BeforeInterventions? After Interventions? 


Dependent Experimental SEN 
Variables Conditions Means s.d. Means s.d. 
Tons per manshift Team building and 
goal setting 14.52 5.63 13.73 5.41 
Goal setting 19.32 8.28 23.16 12.51 
Team building 13.23 2.28 13.22 3.41 
Control group 8.38 3.43 8.91 3.91 
Grade of silver Team building and 
(ounce of silver goal setting 11.08 1.43 15.24 4.01 
per ton of ore) Goal setting 19.95 5.80 20.20 5.85 
Team building 10.75 1.06 14.30 0.45 
Control group 16.07 0.47 32.42 5.97 
Grade of lead Team building and 
(percent of lead goal setting 7.36 1.01 7.58 1.05 
per ton of ore) Goal setting 6.31 0.74 7.13 2.00 
Team building 6.98 0.64 9.33 0.79 
Control group 5.92 1.48 4.37 1.30 


® Means shown are averages of the dependent measures taken during the three months pre- 
ceding the interventions. 
> Means shown are the averages of the dependent measures taken during the three months 
following the interventions. 


» 


315 


Buller and Bell 


1986 


(syaom omy Azada peyjo[d eq} 
Spo ur ani, 








dno18 [oruor) 
mo Supppnq uwag, +teeree 
Áquo ZUPAS [BOL] ae ow 
sures [eod ` 

pue Sutppmq wea, 


uonoejoo 
Byep jo pus 








uononpur Surjas e09 Surp[mq urge] ~ 


uonrpuo;) jejuourrredx Aq aul], J9A0 poyjo[d jjirusuejq sed suo, 
z 29914. 


reet Jad suog 


Academy of Management Journal June ` 


316 


uoroer[oo 


Step jo pui 


{puou Arana payotd ead) 
Stout UT ot. 





ELETE TI 
sabas enm ee 
n Dë ge e e. Zesneenee enee 
Ten a men pe gn 





x 


dnoi8 [ouo ae 

Áquo Surppmq UWE], en 

ATWO BUTIOS Teen anne 
Buyyes peog l 

pue Supppnq weap ——-— 





uoronpur 8urjes peos) Srptng urea, 


uornipuo) [pjueumn1aedxq Aq oui, 12A0 pajjo[qd J9AUS Jo aper) 
E INIA l 


{ə10 Jo Gouigaite jo səvuno) 


IGA[IS JO epeIg 


317 


Buller and Bell 


1986 


(muou &1eA8 payojd ee) 
SUJUOUL UT aui 


dnoi8 [orjuo  — me 
Áquo Song weap «(ee 
.Áq[uo BUNS [LOY -——- 
Suns [eos z 
pue Surpying wear 


t.ro g - 
cue Merz 

not Mn 
PD Pd 


. H H ` 8 


DH 
D 
Aneren rerf 





uonoemoo 
pp jo pug 





uononput Suas Teor) Surppmq urea r, 


uonipuo; [ejueurzadxq Aq oumy 1940 payo[qd peor Jo apeayy 
P INDIA l 


{a10 jo goupeet jo %) 


pea] jo aperin 


318 Academy of Management Journal June 


Note that in Table 1 the largest increase in the mean grade of silver was 
in the control group. However, one stope accounted for 88 percent of this 
increase. In examining these results, the mine geologist stated that the large 
increase in grade in this stope resulted from a change in geological condi- 
tions rather than from improved performance of miners. We therefore con- 
ducted all of the following analyses with and without the data from this 
stope but found no appreciable differences between the results of the two 
analyses. 

Since complete random assignment to experimental conditions was not 
possible, we followed procedures for analyzing quasi-experimental data sug- 
gested by Cook and Campbell (1979). First, we performed an analysis of 
variance on the productivity measures for the two-month baseline period 
preceding the experimental manipulations to determine if any significant 
differences existed among miners in the various conditions. There were no 
significant differences between conditions on pretest measures of tons per 
manshift and grade of lead, but there was a significant main effect for team 
building on grade of silver (F1,16 = 4.28, p<.05). In addition, we conducted 
an analysis of variance on measures of goal acceptance and goal difficulty to 
determine if the experimental conditions differed on these variables. No 
significant differences were found. 

Next, an analysis of variance with repeated measures was performed for 
each of the three productivity variables using measures covering the eight- 
month experimental period. These results are given in Table 2. They show 
that there was a significant difference between stopes in the goal-setting and 

‘no-goal-setting conditions on tons per manshift but not on grades of silver or 
lead. There was a marginally significant difference between the team-building 
and no-team-building conditions on grade of silver but not on tons per 
manshift or grade of lead. There was a marginally significant interaction 
between team building and goal setting on tons per manshift. Additionally, 
there was a significant time effect of the two interventions on tons per manshift 
and a marginally significant time effect on grade of silver. Finally, we found 
a marginally significant team X goal X time interaction on grade of silver. We 
found no significant differences among any of the conditions on grade of 
lead. 

The results of the analysis of variance with repeated measures are, 
unfortunately, somewhat equivocal. We found no significant difference among 
conditions on preexperimental measures of tons per manshift, but found a 
significant difference between the goal-setting and no-goal-setting condi- 
tions over the course of the experiment. This finding, in conjunction with 
the results summarized in Figure 2, provides some support for a positive 
effect for goal setting. Yet, we cannot draw an unambiguous conclusion for 
two reasons. First, even though the difference before the experiment between 
the goal-setting and no-goal setting conditions on tons per manshift was not 
significant, it was substantial and could by itself conceivably account for the 
results. Second, the small sizes of cells along with considerable variability in 
dependent measures could have produced spurious results. 
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The meaning of the marginally significant difference between the team- 
building and no-team-building conditions on grade of silver is also unclear. 
There was a significant difference between these conditions on grade of 
silver before the intervention, with miners assigned to the no-team-building 
condition apparently outperforming those assigned to the team-building 
condition. This initial difference between the two conditions did not change 
during the course of the experiment. However, we cannot conclude that the 
absence of team building led to greater improvement of performance than 
the presence of team building. In addition to small cell sizes and high vari- 
ability in the data, the lack of random assignment to team-building condi- 
tions prohibits firm conclusions. It is possible that other factors, such as 
history, selection-maturation interactions, or other threats to internal valid- 
ity caused the difference found between conditions (Campbell & Stanley, 
1963). 

Next, an analysis of variance with gain scores was conducted. This 
procedure, suggested by Reichardt (1979), examines mean changes in perfor- 
mance between treatment groups from pretest to posttest. It is a useful tech- 
nique for analyzing data from nonequivalent group designs because, like 
analysis of covariance, it helps to control for any pretest differences between 
groups. Analysis of gain scores was more appropriate than ANCOVA with 
our data since the pretest differences between treatment conditions were 
substantial. Unlike the ANCOVA method, the precision of analysis of gain 
scores is not related to the size of pretest differences (Reichardt, 1979). The 
procedure employed a 2 x 2 analysis of variance with goal setting and team 
building as independent variables and scores for changes in productivity as 
the dependent variables. Changes were measured both as absolutes and as 
percentages. We conducted the analysis both with and without data from the 
unusual stope in the control group. No significant main or interaction effects 
were found, indicating that the treatments had no effect. But again we can- 
not draw firm conclusions because, as Reichardt (1979) pointed out, analysis 
of gain scores is subject to other problems—bias from selection-maturation 
interactions, floor and ceiling effects, and regression effects—threats to valid- 
ity that simply cannot be ruled out. ` 

To analyze the effects of treatments further, we assessed changes within 
each experimental condition before and after the inductions. Using time as 
an independent variable with two levels—average performance measures both 
before and after interventions—we conducted a one-way analysis of variance 
on tons per manshift, grade of silver, and grade of lead for each of the four 
experimental conditions. The results showed that the goal-setting-only con- 
dition had a marginally significant effect on tons per manshift (F, 4 = 7.1, 
p«.08); results for no other conditions approached significance. The com- 
bined team building-—goal setting condition had a marginally significant effect 
on grade of silver (F4,5 = 4.5, p<.09) and so did the team-building-only 
condition (F5 = 4.9, p<.08). The team-building-only condition had a mar- 
ginally significant effect on grade of lead (F1, = 4.7, p<.08), with no other 
conditions approaching significance. Although the results of these within- 
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group analyses are only marginally significant, they provide some evidence 
for some treatment effects. Specifically, team building showed a positive 
effect on grades of silver and lead, and goal setting appeared to improve tons 
per manshift. Again, since the treatment groups were nonequivalent and the 
cell sizes small, we cannot entirely rule out factors other than the treatments 
as explanations of the results obtained. 

In summary, results of analyses of the data on productivity were not 
conclusive. The weight of evidence, however, suggests that the treatments 
produced little or no effect on performance. Although there is some evidence 
consistent with a positive but small effect for goal setting on tons per manshift, 
tival explanations of this result could not be ruled out entirely. There was 
also some evidence for a positive effect for team building on grade of ore, but 
this evidence is open to even more questions. 


Strategy Development 


An analysis of variance on pretest scores on the grade strategy checklists 
showed no significant differences among experimental conditions. An analy- 
sis of variance with repeated measures was then performed on pretest and 
posttest scores on grade strategy checklists. We found no significant main 
effects for either team building or goal setting. There was, however, a main 
effect for time (E: ue = 19.7, p<.001), indicating that miners improved grade 
strategy across conditions over time. When we submitted scores from grade 
strategy checklists to an ANOVA with the four experimental conditions as 
separate treatments and time as the independent variable, there was a signifi- 
cant effect over time for the team-building-only condition (F45 = 14.3, 
p<.01), and marginally significant effects for the goal-setting-only (F41, = 
7.7, p<.07), and the combined team building-goal setting condition (F4,5 = 
4.8, p<.08). The control group showed no significant improvement on scores 
for grade strategy. This within-group analysis suggests that the major gains 
in strategy development over time were achieved in the experimental treat- 
ment conditions. However, when an analysis of variance using gain scores 
was conducted, results showed no significant differences among conditions. 
Table 3 summarizes the mean scores on the grade strategy checklist for 
experimental conditions before and after the experimental manipulations. 


Qualitative Assessment of the Interventions 


Although it was not the primary emphasis of this study, we also assessed 
participants’ reactions to the interventions. After the interventions, the first 
author conducted structured interviews with three randomly selected shift 
bosses and ten randomly selected hourly employees; these interviews pro- 
vided data for this evaluation. In addition, these individuals completed a 
brief postexperimental questionnaire pertaining to the specific problems 
identified by hourly employees in the team-building sessions. 

In general, the structured interviews revealed that miners liked both 
interventions. With respect to team building, most miners thought that the 
intervention allowed them to express their ideas to management and get 
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TABLE 3 
Summary of Mean Scores on the Grade Strategy Checklist? 
before and after the Interventions 


Experimental Conditions Before Interventions After Interventions 
Team building and goal setting 55.00 70.00 
Goal setting 70.00 85.00 
Team building 65.83 80.00 
Control group 62.50 66.25 


a Maximum possible score on the grade strategy checklist was 105. 


things out in the open. Several employees observed that there seemed to be 
better relations between management and the union as a result of the 
intervention. Specific comments included: “The meetings brought a lot of 
stuff out, got problems out into the open," "The guys felt better identifying 
problems," and “The meetings helped us to talk better with the opposite 
shift." But several hourly employees mentioned a shortcoming that team 
building had: nothing seemed to change much as a result of the meetings. 
Others noted a lack of interest on the part of the hourly employees and that it 
was a nuisance to come to the meetings. Some representative comments 
were: “Nothing really happened as a result of the meetings," and "Td like to 
see the people take a bigger part; if more would have talked up, things would 
have been better, more things would have changed." 

Those miners involved in the goal-setting intervention most often men- 
tioned regular feedback on their productivity and the grades they were obtain- 
ing as a valuable part of the program. Representative statements were: “Goal 
setting made us more conscious of what we are doing," and “It showed me 
how I can make a little better money—set standards for myself." However, 
two miners expressed suspicion regarding goal setting: "I thought that the 
goals would be used to set day's pay and incentive rates."' 

As noted in the Methods section, the team-building sessions identified 
several specific problems: communication between opposite shift crews; 
communication among underground management personnel; politics and 
favoritism in pay and bid systems; equipment and maintenance; and working 
aud mine conditions. We administered a brief postexperimental question- 
naire to determine these shift bosses' and employees' perceptions of the 
effects of team building on these problems. The questionnaire simply asked: 
“For each of the following problems listed, please indicate if the problem is 

-asaresult of the crew meetings"; the response format ranged from 1 = 
much worse to 5 — much better. Results indicated that workers perceived 
improvement in performance in all of the problem areas. The largest per- 
ceived improvement was in communication between opposite shift crews (X = 
4.07), followed by politics and favoritism (X = 3.92), communications among 
underground management personnel (X — 3.85), equipment and maintenance 
(X — 3.46), and working and mine conditions (X — 3.23). 
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To summarize, the data obtained through structured interviews indi- 
cated that shift bosses and hourly employees generally liked and accepted 
the interventions, although we noted some negative reactions. Further, results 
from the questionnaire showed that employees perceived team building as 
having improved the specific problems it had identified. Obviously, there 
are problems with the validity and reliability of these qualitative findings. 
However, they provide some additional evidence consistent with positive 
outcomes of the interventions. 


DISCUSSION 


Although our design was an improvement over past research in many 
respects, the results of this field experiment were not conclusive. Even 
when results are in the predicted direction, we cannot say with certainty 
that goal setting or team building produced these effects in performance or 
strategy development. The shortcomings of this experiment illustrate the 
many difficulties encountered in doing field research. 

The main problem in interpreting our results is the fact that we could 
not ensure that experimental and control groups were equivalent. Differ- 
ences between groups owing to selection, maturation, or instrumentation 
could therefore have influenced the findings. We tried several, but cannot 
be certain that any statistical procedure we employed accounted for all po- 
tential biases in our results. As Cochran and Rubin (1973) pointed out, “If 
randomization is absent, it is virtually impossible in many practical circum- 
stances to be convinced that the estimates of the treatments are in fact 
unbiased” (1973: 417}. Cook and Campbell (1979) suggested that, in interpre- 
ting the results of such quasi experiments, it is important to consider their 
consistency with previous research. The findings of previous research on 
goal setting are quite clear: hard, specific goals, if accepted, lead to improved 
task performance. The results of this experiment, although in the positive 
direction, are not as clear or dramatic as those of previous studies. 

Several factors relating to the research design and setting may help to 
explain our relatively weak results. First, the small number of stopes in the 
analysis was a limitation, as was the high degree of variability in the perfor- 
mance data. Both limitations make it difficult to detect systematic variance. 
In addition, the experiment was conducted with unionized hard-rock miners— 
subjects who were skeptical of modern management techniques. As two 
employees remarked in the postexperimental interviews, there was some 
concern that the goal-setting intervention was an attempt by management to 
boost productivity and then adjust the incentive pay scales downward. If 
this sentiment was widespread, it may have caused some miners to restrict 
their output. Although miners in the goal-setting condition indicated a high 
level of goal acceptance, we cannot be certain that this acceptance was real 
because of problems with the measure of acceptance used in the study. 
Specifically, our measure of goal acceptance consisted of a single, face-valid 
question whose validity has been questioned because goal-setting partici- 
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pants tend to respond positively to it with little variance in their responses 
(Locke et al., 1981). And, even though employees expressed generally posi- 
tive reactions to the goal-setting intervention in the structured interviews, 
reactive effects of the interview itself may have colored their responses. 
Thus, the weak results may simply reflect miners’ relatively weak accep- 
tance of or commitment to the goals set. Finally, the nature of the miners’ 
task and working environment may have contributed to the minimal results. 
Hard-rock mining is a difficult task in an uncertain environment. Although 
the performance of miners is a function of their ability and motivation just as 
it is in other jobs, events beyond their control can influence their output, 
perhaps more so than in many other blue-collar jobs. In such an environment, 
it is possible that increases in miners’ efforts and strategy development did 
not translate into improved performance. 

Yet, these limitations of the study are also its strengths. Previous research 
on goal setting has been conducted primarily in laboratories under con- 
trolled conditions. Those field experiments that have been conducted have 
generally had nonunionized participants performing relatively simple, rou- 
tine tasks. As others have pointed out, problems may arise in applying goal 
setting in situations in which tasks are complex, unstructured, or interde- 
pendent (Baumler, 1971; Latham & Yukl, 1975; Locke & Latham, 1984). The 
present study suggests that the effects of goal setting on performance may be 
muted under such conditions. Further research in similar settings is needed 
to continue to assess the degree to which goal-setting effects can be 
generalized. 

Also, previous research examining the effects of team building on perfor- 
mance has been inconclusive, largely because of poor research designs and 
reliance on subjective measures. The present study, employing a more rigor- 
ous design and objective measures, also produced ambiguous results. 
Although a modest improvement in grade of ore occurred in the team-building 
conditions, we cannot be certain why it occurred. Several factors, in addi- 
tion to those cited above, may have contributed to the results obtained. First, the 
team-building intervention was more diffuse than the goal-setting interven- 
tion. Although increased effectiveness in production was one of the objec- 
tives of team building, the sessions did not focus specifically on increasing 
tons per manshift and quality of ore. The broad nature of this intervention 
could have reduced its impact on these output variables, even though miners 
perceived improvements on the specific problems identified by team building. 
Related to this issue is the fact that we evaluated performance for only 
three-and-a-half months following the intervention. Because team building 
involves rather complex and relatively slow processes of developmental 
change (French & Bell, 1984; Porras & Berg, 1978), our analysis may not have 
captured the full effects of the intervention. A longitudinal assessment is 
naw being conducted to examine long-term effects. 

Two final problems in interpreting the results on productivity concern 
possible selectivity bias and reactive effects of the experiment. The mine 
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superintendent made the nonrandom assignment of stopes to the team- 
building condition primarily because those stopes were on the two major 
production levels of the mine. And, although nothing suggests that miners in 
the team-building condition differed from those in the non-team-building 
condition, the geological conditions of the stopes differed between these 
treatment groups. Stopes in the non-team-building group were primarily at 
high elevations in the mine, where the rock is harder than at low elevations. 
Generally, the harder the rock, the faster miners can drill, blast, and muck. 
The non-team-building stopes also apparently had richer veins. These differ- 
ent geological conditions may explain the preexperimental differences 
between treatment groups. The geological differences between stopes were 
impossible to control for and make it difficult to compare effects on perfor- 
mance between treatment groups. Under these conditions, the results of the 
analyses within treatment groups, which examined changes in performance 
over time, are the most informative. 

The reactive effects of the experiment may also have caused problems. 
As mentioned, employees in this experiment were not used to being studied 
by people other than “efficiency experts.” Although there is no strong 
evidence, some miners may have suppressed their output out of suspicion of 
the researchers’ motives. On the other hand, the modest performance effects 
found may have been due to miners’ increased motivation resulting from the 
so-called Hawthorne effect. The structured interviews and informal observa- 
tions revealed that many miners felt positive about, and even enjoyed, all the 
attention from the researchers and mine management. These possible Haw- 
thorne effects do not overly concern us because one of the causal elements of 
most behavioral science approaches to improving productivity, particularly 
an approach like team building, is increased attention and focus on employ- 
ees and their work. In fact, we argue that the objective of many such interven- 
tions is a sustained Hawthorne effect. 

Finally, we cannot entirely rule out the possibility that competition 
between groups or diffusion of the treatments between groups affected the 
results. The interventions were public in the sense that employees knew 
who received team building or goal setting. Although there is no evidence to 
suggest this, it is possible that rivalry could have emerged between the haves 
and the have-nots. Diffusion of the interventions also could have occurred, 
but this was not very likely, particularly in the team-building condition. 
Miners in the control group could have set personal goals as a result of the 
intervention, but there is no evidence that they did. Only those miners in the 
goal-setting condition received weekly performance-feedback graphs on tons 
per manshift and grade of ore to minimize possible contamination of the 
no-goal-setting groups. However, all miners received bimonthly information 
on tons per manshift on their paychecks; they did not, however, get feedback 
on grade of ore. Thus, miners in the goal-setting condition received more 
frequent and specific feedback than other miners. 

The present research produced some information, consistent with previ- 
ous research, regarding the cognitive effects of goal setting. Miners who set 
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goals and received periodic feedback appeared to develop and apply better 
strategies for improving the grade of ore they produced than miners in the 
control group. Again, the limitations cited for the productivity variables 
make firm conclusions unwarranted. It is also important to note that any 
improvements in grade strategy owing to the effects of goal setting were not 
translated into increased grade quality. The weak effects of strategy develop- 
ment or conditions beyond the miners' control affecting grade of ore may 
have been responsible for this lack. The study suggests, however, that cogni- 
tive mechanisms as well as motivational mechanisms can affect the relation- 
ship between goal setting and task performance. Employees who set specific 
hard goals and receive feedback on them may not only work harder than 
employees who do not set goals and receive feedback—they may also develop 
more appropriate ways to accomplish task goals. Team building also appeared 
to have a positive effect on strategy development, consistent with Hackman’s 
(1976) model. And, even though strong conclusions cannot be drawn, it is 
interesting to note that team building may have influenced the development 
of strategies which in turn may have improved quality of grade. 

In summary, although the findings of this field experiment are unclear, 
the results are important for several reasons. First, the study raises a question 
about the extent to which goal-setting effects are generalizable to union 
employees working on complex and uncertain tasks. Further research under 
similar conditions is warranted. Second, the study suggests that goal setting 
and team building may affect cognitive mechanisms such as strategy 
development. Future research should continue to examine this mechanism 
using more refined measures. Third, the study points to our continuing lack 
of understanding of how team building influences performance. We attempted 
to operationally define this intervention more clearly and to examine its 
effects more systematically than previous research has done, and yet its 
effects remain elusive. Further research should attempt to document team- 
building activities and possible intervening mechanisms more precisely and 
examine their relation to outcome variables. Perhaps path analysis or other 
causal analysis techniques (James, Muliak, & Brett, 1982) might provide 
useful means for assessing the multiple relationships generated by the team- 
building intervention. Finally, declining productivity continues to be a major. 
problem in underground mines and other organizations, and managers con- 
tinue to be beseiged with a large number of quick-fix solutions from behav- 
ioral scientists and others. Unfortunately, many approaches to improving 
employees' productivity remain untested. The present research, by provid- 
ing a systematic evaluation of two specific techniques for improving the 
performance of employees, provides managers with a more objective basis 
on which to judge the usefulness of team building and goal setting. 
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ENVIRONMENTS AS MODERATORS 
OF THE RELATIONSHIP 
BETWEEN STRATEGY AND PERFORMANCE 


JOHN E. PRESCOTT 
University of Pittsburgh 


Environments have been viewed as key contingency variables for rela- 
tionships between strategy and performance. Yet, the nature of this 
contingency relationship has not been resolved; do environments mod- 
erate its form or its strength? Using moderated regression analysis and 
subgroup analysis, this study demonstrated that environments, as mea- 
sured by characteristics of market structures, moderate the strength but 
not the form of relationships between strategy variables and per- 
formance. 


Both a business' environment and its strategy have been hypothesized 
and empirically demonstrated to have significant effects on performance 
(Porter, 1980; Scherer, 1980). Previous research has considered strategy to be 
basically under the control of managers, but has viewed environments as 
constraints that in certain situations managers can proactively change (Hofer 
& Schendel, 1978; Pfeffer & Salancik, 1978). Thus, much of the strategic 
management literature has focused on the relationship between strategy and 
performance and considered environments as moderators of that relationship. 
The field of industrial organizational economics has emphasized the linkage 
between environment and performance and thus has viewed environments 
- as primary determinants of performance (Porter, 1981). Recently, there have 
been attempts to examine the relationships among environment, strategy, 
and performance variables (Hambrick, 1986; Hitt, Ireland, & Stadter, 1982; 
Jauch, Osborn, & Gluck, 1980). However, research examining that threefold 
relationship has not adequately addressed the issue of whether environ- 
ments are (1) independently related to performance, (2) moderators of the 
relationship between strategy and performance, or (3) some combination of 
the two. The purpose of the present study is to examine this issue. 

The research question was this: do environments independently influ- 
ence performance, or do environments modify the strength or the form of the 
relationship between strategy and performance? This question addresses the 
heart of contingency theory (Schoonhoven, 1981). If environments indepen- 
dently influence performance, then contingency relationships do not exist. If 
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environments modify the strength of the relationship between strategy and 
performance, the role of contingency theory would be to identify meaningful 
subenvironments and to determine the relative strength of the effects of 
these subenvironments on relationships between strategy and performance. 
If, on the other hand, environments modify the form of the relationship 
between strategy and performance, the role of contingency theory would be 
to identify key interactions and establish their links to performance. 


THEORETICAL BACKGROUND AND PAST RESEARCH 


Scholars in the field of strategic management have conceptualized envi- 
ronment as one of the key constructs for understanding organizational behav- 
ior and performance (Hofer & Schendel, 1978). Basing their research on an 
extensive review of over 80 articles, Lenz and Engledow (1986) identified 
five approaches to modeling environments: the industry structure model 
(Porter, 1980), the cognitive model (Weick, 1979), the organizational field 
model (Dill, 1958), the ecological and resource dependency model (Aldrich, 
1979; Pfeffer & Salancik, 1978), and the era model (Naisbitt, 1982). All these 
approaches to modeling environments vary in terms of assumptions about 
environmental structures, assumptions about the process and causes of envi- 
ronmental change, and assumptions about how managers or researchers know 
and understand environments (Lenz & Engledow, 1986). 

Regardless of how environments are modeled, research findings suggest 
that their characteristics influence decision making through managerial per- 

ceptions and objective dimensions of industries’ structures. For example, 
Miles and Snow's (1978) research illustrated that managers enact their 
domains by focusing on certain conditions, trends, and events in their 
environments. Snow and Miles (1983) further suggested that “those factors 
that go unnoticed or are deliberately ignored are not part of the organization's 
enacted environment and thus do not affect managerial decision making and 
action" (1983: 239). - 

Lawrence and Lorsch (1967) found that managers in the standardized 
container industry perceived their environment to be more certain than 
managers in the plastics and consumer foods industries. Further, they found 
that the dominant environmental requirements varied among industries; prod- 
uct and technological development dominated in plastics, product develop- 
ment and market research in consumer foods, and customer service and 
product quality in standardized containers. _ 

` The basic model for industrial organizational economics is the struc- 

` ture-conduct-performance paradigm. According to the paradigm, the per- 

formance of a firm depends on its conduct in such matters as pricing policies, 

R & D emphasis, and investment policies. Conduct, in turn, depends on the 

structure of the industry, which includes the barriers to entry, level of 
concentration, and degree of product differentiation (Scherer, 1980). 

Economists employing the market structure-conduct-performance para- 
digm have suggested for years that objective characteristics of industries 
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such as rates of growth and concentration levels affect both the conduct or 
strategy of firms and their performance (Bain, 1959; Scherer, 1980). The 
Profit Impact of Market Strategy! (PIMS) studies found that industry charac- 
teristics háve an effect on the performance of business units, and Rockart 
(1979) suggested that industry characteristics determine certain factors criti- 
cal for the success of business strategies. Porter (1980) developed probably 
the most comprehensive treatment of industry influences on firms' strategies 
and performance levels. Overall, these findings have suggested that both the 
perceived and objective environments of an industry moderate the relation- 
ship between a business' strategy and its performance. That is, different 
environments will have different sets of critical success factors (Rockart, 1979). 
However, there have been very few attempts to examine the moderating 
effects of environments on the relationship between strategy and perfor- 
mance. 


Moderating Effects of Environments 


In general, studies can examine moderating effects in two ways: either 
(1) by including interaction variables in an additive model, or (2) by estimat- 
ing parameters of an additive model for subgroups of a total sample (Arnold, 
1982; Sharma, Durand, & Gur-Arie, 1981). Analysis using interaction terms 
examines the form of a relationship and subgroup analysis examines the 
strength of a relationship. Jauch, Osborn, and Glueck (1980), employing 
moderated regression analysis, examined the interacting effects of nine envi- 
ronmental changes on eight strategic decisions and two performance 
measures. Their results indicated that environmental change has little rela- 
tionship to performance, although strategic decisions concerning financial 
changes and production efficiency were statistically significantly related to 
performance. The interaction of environmental changes and strategic deci- 
sions was not significant. The only other study found that used moderated 
regression analysis to examine relationships between environment, strategy, 
and performance was by Hitt, Ireland, and Stadter (1982). Their study exam- 
ined separately the moderating effects of two variables, grand or corporate 
strategy and type of industry, on the strategic importance of functional areas 
and company performance. Their results indicated that both grand strategy 
and type of industry had separate, significant moderating effects on the 
relationship between functional areas and performance. An attempt to exam- 
ine the three-way interaction of industry type and grand strategy simulta- 
neously on the relationship between functional importance and performance 
was not possible because their samples were small. Thus, the question of 
whether grand strategy and industry type simultaneously have an interacting 
effect on the relationship between functional importance and performance 
was left unanswered. 


1 For a comprehensive review of research on strategy that has used the PIMS data base, see 
Ramanujam and Venkatraman (1984). 
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Prescott (1983) used analysis of variance to examine the effects of the 
interaction of Porter's (1980) sets of generic strategies and generic industries 
on the performance of business units. Porter's generic strategies include 
overall cost leadership, focus, and differentiation (1980: 34—46); his generic 
industries include fragmented, mature, global, declining, and emerging (1980: 
chaps. 9—14). Generic strategies explained 12 percent of variance in 
performance, and generic industries and the interaction of the two sets 
explained 1 and 2 percent respectively. A potential problem with using such 
generic categories is a loss of information through aggregation. 

The second approach to examining the moderating effects of environ- 
ments on the relationship between strategy and performance is subgroup 
analysis. The assumptions underlying subgroup analysis are that an interac- 
tion term of the moderated regression type is not significant and that the 
error term is posited to be a function of the moderator variable (Sharma et al., 
1981). Therefore, if subgroups that are homogeneous in terms of error vari- 
ance are formed from the total sample, the predictive efficacy of the sub- 
groups should increase. The typical tests for the existence of subgroups are 
the chi-square test, the Chow test (Chow, 1960), and their extensions (Neter 
& Wasserman, 1974). Many studies have employed subgroup analysis in the 
context of the relationship between strategy and performance (Bass, 1974; 
Bass, Cattin, & Wittink, 1977; Gale, 1972; Hambrick, 1986; Hatten & Schendel, 
1977; Hatten, Schendel, & Cooper, 1978; Newman, 1978; Oster, 1982; Phillips, ` 
Chang, & Buzzell, 1983; Porter, 1979; Prescott, 1983). Two major issues with 
which these studies have been concerned are the formation of subgroups and 
the testing of subgroups for homogeneity. The studies have generally con- 
cluded that use of subgroups increases the predictive efficacy of regression 
equations and that the strength of strategy variables varies across the 
subgroups. 

In sum, although studies employing moderated regression analysis have 
examined the ways in which environments affect the form of the relation- 
ship between strategy and performance, studies employing subgroup analy- 
sis have typically examined only the strength of this relationship across vari- 
ous subgroups and assumed that there was no statistically significant 
interaction. No study. has had the explicit purpose of examining both the 
form and the strength of the moderating effect of environments on the rela- 
tionship between strategy and performance. 


TYPOLOGY OF SPECIFICATION VARIABLES 


Building on the work of two previous researchers (Lazarsfeld, 1955; 
Rosenberg, 1968), Sharma and his colleagues (1981) developed a typology of 
specification variables. A specification variable is one that specifies the form 
orthe strength, or both, of the relationship between a predictor and a criterion. 
Figure 1 depicts this typology, which has two dimensions. Shown on the 
horizontal axis, the first dimension addresses whether a specification vari- 
able is related to a criterion variable. On the vertical axis, the second dimen- 
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| FIGURE 1 
Typology of Specification Variablesa: b 
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aNote.that in this study the predictor variables were strategy variablas; the criterion variable 
was ROI, and the specification variable was environment. 
bAdapted from Sharma, Durand, and Gur-Arie (1981). 


sion addresses whether the specification variable interacts with the predic- 
tor variable. For the purposes of the present study, the predictor variables 
were strategy variables, the criterion variable was return on investment (ROI), 
and the specification variable was environment. ' 

In Figure 1, the specification variable could be related to the criterion 
variablé or to the predictor variable or to both without interacting with the 
predictor. This specification variable might then be called an intervening, 
exogeneous, antecedent, suppressor, or predictor variable, on the basis of 
other considerations. Cell 1 represents these possibilities. The remaining three 
cells represent different types of moderator variables. Cell 2 represents a 
moderator variable that is not significantly related to the criterion and predic- 
tor variables and does not interact with the predictor. This type of moderator 
variable, referred to as a homologizer, influences the strength of the relation- 
ship between the predictor and criterion variables across homogeneous sub- 
groups (Arnold, 1982; Zedeck, 1971). : 
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Moderator variables in cells 3 and 4 influence the form of the relation- 
ship between the predictor and criterion variables. A moderator variable that 
influences the form of a relationship implies a significant interaction between 
the moderator and predictor variables. Moderator variables in cell 3 are 
significantly related to the criterion or predictor variables or to both and are 
referred to as quasi moderators. Cell 4 moderators are not significantly related 
to either criterion or predictor variables and are termed pure moderators 
(Sharma et al., 1981). 


HYPOTHESIS 


Contingency theory suggests that key strategic requirements vary depend- 
ing upon environmental conditions. But contingency theory provides little 
guidance for predicting the form or strength of relationships between strat- 
egy variables and environments. As was discussed in the preceding pages, 
empirical research using moderated regression analysis and subgroup analy- 
sis has generally concluded that the form of such relationships does not vary 
across subenvironments but that their strength does vary. On the basis of 
these conclusions, the following is posited: 

Hypothesis 1: Environments moderate the strength but 

not the form of relationships between strategy variables 

and performance. 
This hypothesis states that the interaction terms of environment and strategy 
variables will not be statistically significant. However, the strength of the 
relationship between strategy variables and performance will vary across 
environmental subgroups. In terms of Figure 1, the hypothesis states that 
environment is a homologizer. : 


METHODS 
Analysis 


The analysis followed the procedures for identifying moderator vari- 
ables outlined by Arnold (1982) and by Sharma and colleagues (1981). The 
model developed in this study identified environment as the moderator 
variable, a set of strategy variables as the predictor variables, and ROI as the 
criterion variable. An alternative model having strategy variables as modera- 
tors and environment as the predictor was rejected because, as Arnold (1982) 
suggested, ‘from the viewpoint of substantive theory, it is sensible to main- 
tain the convention of labeling the primary causal variable under investiga-. 
tion as the ‘independent variable’ X and the additional variable hypothe- 
sized to moderate the causal impact of X as the ‘moderator variable’ Z” 
(1982: 154). Thus, the strategy variables were chosen as predictors because 
the field of strategic management focuses on managerial choices within envi- 
ronmental contexts. The procedure for identifying moderator variables had 
four steps: 

Step 1: Using moderated regression analysis, determine if a signifi- 
cant interaction éxists between the hypothesized moderator and predic- 
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tor variables. If a significant interaction exists, go to step 2. Otherwise, 
proceed to step 3. , : 

Step 2: Determine if the moderator variable is significantly related 
to the criterion variable. If it is, then the variable is a quasi moderator 
(cell 3, Figure 1). If not, the variable is a pure moderator (cell 4, Figure 
1). The moderator variable in both cases influences the form of the 
relationship between the predictor and criterion. 

Step 3: Determine if the moderator variable is significantly related 
to the predictor variable. If it is, then it is an exogeneous, predictor, 
intervening, antecedent, or suppressor variable (cell 1, Figure 1). If the 
hypothesized moderator is not related to either the predictor or crite- 
rion variables, proceed to step 4. 

Step 4: Develop subgroups based on the hypothesized moderator 
variable. Test for the significance of predictive validity across the 
subgroups. If significance is found, then the variable is a homologizer 
operating through the error term (cell 2, Figure 1). If significant differ- 
ences are not found, then the variable is not a moderator. 


Data 


Data for environment, strategy, and performance variables were identi- 
fied for 1,638 business units in tbe PIMS data base over the years 1978 
through 1981. Several studies have examined the quality and reliability of 
this data base (Anderson & Paine 1978; Hambrick, McMillan, & Day, 1982; 
Phillips & Buzzell, 1982; Phillips et al., 1983). The work of these authors 
suggests that both the quality and reliability of the data base is high.? 


Variables 


The overall measure of environment used in this study was a discrete 
variable with eight categories representing business units’ environments. 
Every business unit was classified into one of eight categories according to 
its pattern on 17 environmental components representing characteristics of 
market structures. The categories were developed through cluster and dis- 
criminant analysis of the 17 environmental components (see the Appendix). 
The eight environmental categories are emerging, fragmented, fragmented 
with auxiliary services, global importing, global exporting, stable, mature, 
and declining. The Appendix describes their key characteristics. The eight 
categories are reported here because the results indicated that subgroup 
analysis was necessary to identify the moderating effect of the environment 
on the relationship between the strategy variables and performance. Further, 
the eight categories proved to be the most concise method of presenting the 
results while not distorting the analysis, results, or interpretation. The data 
were analyzed in three ways according to the procedure for identifying mod- 
erator variables described in the Methods section. First, each individual 
environmental component was analyzed separately. Second, groups of 


? See Ramanujam and Venkatraman (1984). 
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‘environmental components were analyzed in various combinations. Third, 
the eight environmental categories were analyzed together. The results were 
similar for all three analyses. 

Strategy variables were chosen by measuring parts of strategy (Hambrick, 
1980). Seventeen variables were classified into four broad theoretical dimen- 
sions of strategy, defined as follows (Hambrick, 1983): cost efficiency, the 
degree to which costs per unit of output are low; asset parsimony, the degree 
to which assets per unit of output are few; differentiation, the degree to 
which a product and its enhancements are perceived as unique; scale scope, 
the relative size and range of activities of a business within its industry. 

Software limitations prohibited use of all 17 variables. Therefore, on the 
basis of past research using the PIMS data base and the list of strategy 
variables suggested by Hofer (1975) to be important for strategy research, 9 
strategy variables were selected from the set of 17. Three criteria were applied: 
(1) representation of each of the four broad theoretical dimensions of strategy, 
(2) lack of multicollinearity, and (3) maximum R? with ROI with fewest 
possible variables. 

Table 1 presents the set of strategy variables chosen for this study and 
their association with the four broad dimensions of strategy. Only 3 of the 36 
intercorrelations among the strategy variables exceeded .20: capacity utiliza- 
tion and manufacturing expenses/revenues, —.23; employee productivity and 
marketing expenses/revenues, —.25; and relative product quality and relative 

` market share, .31. Thus the set of variables demonstrated a lack of multi- 
collinearity. In terms of maximizing R?, the final combination of nine strate- 
gic variables accounted for 40 percent of the variance in ROI. Adding the 
remaining eight explained an additional 2 percent of the variance. These 
eight variables by themselves explained only 12 percent of the variance in 
ROI. 

Several of the variables are arithmetically related to ROI, as is noted in 
Table 1. This implies that similar mathematical manipulations may partially 


TABLE 1 
Strategy Variables and Their Associations 
with Theoretical Dimensions of Strategy 





Strategy Variables Associated Dimensions of Strategy 

Investment/Intensity> Asset parsimony 

Capacity utilization Asset parsimony 

Employee productivity> Cost efficiency 

Relative direct cost : Cost efficiency 

Manufacturing expenses/gross revenues? Cost efficiency 

Total R&D expenses/gross revenuesb Differentiation, cost efficiency, or both 
Total marketing expenses/gross revenuesb Differentiation, cost efficiency, or both 
Reletive product quality Differentiation 

. Relative market share Scale/scope 


aThe four broad theoretical dimensions come from Hambrick (1983). 
bStrategic variables with an arithmetic relationship to ROI. 
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account for the observed relationships between those variables and ROI 

(Anderson & Paine, 1978; MacMillan, Hambrick, & Day, 1982). This could 

potentially limit the observation of differences in the forms of the relation- 

ships between strategy variables and performance across categories of. 
environments. In the absence of longitudinal data that would allow examina- 

tion of both lagged and mathematical effects, an alternative methodological 

check was made. The eight remaining strategy variables from the original set. 
were analyzed with the same procedure as our nine variables. These remain- 

ing variables were receivables/revenues, total inventory/revenues, degree of 

backward integration, degree of forward integration, relative compensation, 

percentage of purchases from the three largest suppliers, relative product 

line breadth, and relative price. The results of this analysis were consistent 

with those reported in the Results section; the Discussion section includes 

highlights of this analysis. Thus, although the strategy variables that were 

arithmetically linked to ROI may have produced R?s that were biased upward, 

they do not confound the interpretation of the moderating effect of environ- 

ment on the relationships between strategy variables and the performance 

variable, ROI. 


RESULTS 


This reporting of results follows the four-step procedure for identifying 
moderator variables outlined in the Methods section. Table 2 presents the 
results of a moderated regression analysis for strategy variables, environmental 
categories, and their interaction. The adjusted R?s are reported. The strategy 
variables accounted for 40 percent of the variance in ROI; adding the environ- 
mental categories and interaction terms accounted for 2 and O percent 
respectively. Because the interaction term was not significant, the next step 

. Was to determine if environmental categories were significantly related to 
the strategy variables (predictors), or to ROI (criterion), or to both. 

Both correlation analysis and regression analysis were used to deter- 

. mine if environmental categories were significantly related to ROI. The prod- 
uct moment correlations for each of the 17 environmental components with 
ROI revealed that 15 of the 17 had correlations of a magnitude less than or 
equal to .07 (p > .10). Percentage of employees unionized had a correlation of 


TABLE 2 
Results of Moderated Regression Analysis for Strategy Variables, 
Environment, and Their Interaction 


Independent Variables l Cumulative ? F df AR? F df 
e aaae kaletan en a EE 


Strategy 408 124.0* 9,1628 A0 124.00* 9,1628 
Strategy and environment 42 74.1* 16,1621 .02 5.00* 7,1621 
Strategy-environment Interaction 42 16.2* 79,1558 .00 0.00 63,1558 
dch 


sValues reported are for adjusted R?. 
*p < .001 
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—.11 with ROI, and frequency of product changes had a correlation of .10. 
Neither of these correlations is high enough to suggest a significant relation- 
ship with ROI (p > .10). Regressing the eight environmental categories and 
the 17 environmental components separately against ROI produced adjusted 
Bis of oi (F7,1630 = 2.85, p < .006) and .05 (F17,1620 = 5.63, p < .001), 
respectively. We obtained separate regression equations for both the eight 
environmental categories and the individual environmental components to 
check the consistency of the results, because using aggregated categories in 
regression analysis often leads to a significant loss of information. The results 
of the two sets of regressions were consistent, indicating that the loss of 
information was not serious. These results, coupled with those of the moder- 
ated regressions, suggest that although the relationship between environ- 
ment and ROI is statistically significant owing to large sample sizes, its 
practical significance is questionable. 

The intercorrelations of each environmental component and strategy 

variable were examined to determine if they were significantly related. Of 
the 153 intercorrelations (17 environmental components X 9 strategy 
variables), four were significant (p « .05). These four significant correlations 
were industry value added and employee productivity, .42; life cycle stage 
.and R & D expenses/revenues, —.26; industry exports and R & D expenses/ 
revenues, .29; and importance of auxiliary services to end users and R & D 
expenses/revenues, .29. The results suggested that, overall, the environment 
and strategy variables were not significantly related at the p « .05 level. 

The final step in the analysis was the identification of subgroups on the 
basis of the presumed moderator variable. We used the eight subenviron- 
ments referred to in the Methods section and briefly described in the Appen- 
dix for the subgroup analysis. Table 3 shows the strategy variables that had 
betas significantly (p < .05) related to ROI in each subenvironment. This table 
illustrates two important points. First, the sign of the betas for each strategy 
variable does not change across the subenvironments, which indicates a lack 
of interaction and is consistent with the results of the moderated regression 
analysis. A chi-square test (Arnold, 1982) for the strength of the correlations : 
between each strategy variable individually and ROI across the eight 
subenvironments was only significant (p « .05) for the correlation between 
R & D expenses and revenues. Thus, examining the strength of the relationship 
between strategy variables and ROI individually across subgroups is not 
likely to demonstrate statistically significant differences. 

The second important point that Table 3 illustrates is that the pattern of 
significant strategy variables varies across the eight subenvironments. A chi- 
square test (Arnold, 1982) for the difference in the strength of the adjusted 
R?s for the set of nine strategy variables across the eight subenvironments 
was not significant. Therefore, the predictive validity of the set does not vary 
across the subenvironments. However, a Chow test (Chow, 1960) of the 
equality of both the slopes and the intercepts of the regression lines across 
the subenvironments was significant (F — 2.08, df — 63, 1558, p « .01). A 
further examination of the regression lines showed differences in their slopes 
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across the subenvironments that were at best marginally significantly differ- 
ent (F = 1.77, df = 63, 1558, p < .06). Therefore, the main difference between 
the regression lines was in their intercepts and not in their slopes. These 
results suggest that the set of strategy variables that are significantly related 
to ROI varies by subenvironment. This is the primary moderating effect of 
environments on the relationship between strategy and performance. 

The results reported here support the hypothesis that environments mod- 
ify the strength but not the form of the relationship between strategy vari- 
ables and performance. Specifically, environment acts as a homologizer that 
modifies the strength of the effects of subsets of strategy variables within 
particular subenvironments. The subset of strategy variables that were signifi- 
cantly related to ROI varied across the subenvironments; however, the pre- 
dictive validity of the complete set did not show statistically significant 
differences. 


DISCUSSION 


Researchers in the field of strategic management have viewed environ- 
ment as a key contingency variable for strategy formulation. Thus, the find- 
ing that environment, as measured by characteristics of market structures, 
moderates the strength but not the form of the relationship between strategy 
and performance has important implications for future theory and research, . 
public policy, and managers. One implication of this study for contingency 
theorists is a need to identify environmental subgroups and to examine re- 
lationships between strategy and performance within and across these sub- 
‘groups. The development and identification of subgroups has become an 
increasingly important topic in strategy research (Hambrick, 1983, 1984; 
Miller & Friesen, 1984). This study developed subgroups using multiple 
indicators of market structures and multivariate classification techniques. 
Future research should explore other methods of identifying environmental 
dimensions, such as using perceived characteristics (Miller & Friesen) and 
other grouping techniques. These methods and techniques should then be 
examined within the framework developed by Arnold (1982) and Sharma 
and colleagues (1981) for their moderating influences. For example, there is 
some evidence that strategy process variables like managerial decision mak- 
ing styles moderate the forms of relationships among market share, strategy, 
and performance (Gupta & Govindarajan, 1984). An extension of their study 
would be to explore if and how environment influences these relationships. 

This study also suggests that when examining parts of business-level 
strategy (Hambrick, 1980), researchers should examine multiple strategy 
variables rather than one or a few. The important finding that relationships 
among environment, strategy, and perfórmance vary primarily when analy- 
‘sis includes multiple strategy variables supports this conclusion. Further, 
researchers should use multiple performance measures when possible. 

The relative significance of the strategy variables within the subenviron- 
ments provides insights as to the roles of the theoretical dimensions of 
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strategy across the subenvironments (Hambrick, 1983; see Table 1). Findings 
were similar to those of MacMillan and colleagues (1982) in that asset parsi- 
mony and reduced costs helped performance across all of the subenviron- 
ments. However, in the mature, declining, and global exporting subenviron- 
ments these dimensions were especially critical. The mature subenvironment 
‘further differed from the others in that relative market share, a variable 
associated with scope, was significant and positive while backward vertical 
integration, also associated with scope, was negative and significantly related 
to ROI. The mature subenvironment was also the only one in which relative 
price, associated with differentiation, was positive and significantly related 
to ROI. Thus, the mature subenvironment had significant strategy variables 
in each of the four theoretical dimensions of strategy. The stable, non- 
fragmented environment was similar to the mature one except that the rela- 
tionships were somewhat weaker. 

The fragmented subenvironment provided a situation that sharply con- 
trasted with the mature subenvironment. As expected, the scale/scope and 
differentiation dimensions played small roles in the fragmented subenviron- 
ment. It appears that fragmented subenvironments lend themselves to fewer 
strategic options than the other subenvironments. 

In the global importing subenvironment, the scale/scope dimension was 
found to be significant. Not only was relative market share important, but 
both forward and backward integration increased return on investment. This 
can be contrasted with results for the emerging subenvironment, in which 
scope was not a significant dimension, and with results for the declining 
subenvironment, in which scope was an important dimension but the differ- 
entiation dimension seemed to play a small role. These few observations 
suggest that the subenvironments can to a certain degree be contrasted and 
compared using the theoretical dimensions identified by Hambrick (1983). 
Future research should more closely examine the theoretical dimensions of 
strategy and how different environments influence the significance of each 
dimension for competitive strategy and performance. 

Industrial organizational economics is extremely concerned with com- 
petition’s implications for social welfare (Scherer, 1980); strategic manage- 
ment is more concerned with the performance of firms (Porter, 1981). The 
main implication of this study for public policy is that an environment’s 
market structure should not be viewed primarily as an independent variable 
directly influencing the performance of firms. Rather, environment moder- 
ates the strength of relationships between business units’ strategies and their 
performance. For example, market share, often used as a measure of market 
power and performance and thus as a rationale for antitrust action, may not 
be an appropriate indicator of either power or performance in certain contexts. 
This indeed was the case in this study, in which market share was signifi- 
cantly. related to performance in only four of the eight environments. Two 
previous studies (Phillips et al., 1983; Prescott, Kohli, & Venkatraman, 1984) 
found similar results. Thus, public policy decisions need to include strategy 
variables for individual firms and to assess the relative effects of those vari- 


342 Academy of Management Journal June 


ables in the envirenments under consideration. The finding that market 
structure characteristics are mildly related to ROI is consistent with results 
from much research in industrial organizational economics (Bothwell, Cooley, 
& Hall, 1984; Scherer, 1980). Scherer also suggested that economists include 
firm variables in their studies. 

Finally, managers who develop strategies to either adapt to changing 
environmental conditions or to proactively influence their environments 
should find the results of this study encouraging. The finding that environ- 
ment moderates the strength but not the form of relationships between strat- 
egy and performance certainly makes their task easier. Specifically, environ- 
ment moderates the set of strategy variables significantly related to ROI. 
Managers should focus on identifying the strategy variables significantly 
related to performance in their environments and adjust their strategies 
accordingly. The technique of critical success factors popularized by Rockart 
(1979) could assist in the process. If environment does not change the form 
of relationships between strategy and performance, perhaps changing envi- 
ronmental situations require changes in the relative emphases of strategies 
rather than changes in strategies. 


CONCLUSIONS 


The relationship between the environment, strategy, and performance of 
a business is a central concern of strategic management. This study used 
moderated regression and subgroup analysis to explore the moderating influ- 
ence of environment on the relationship between strategy and performance. 
In the results of the moderated regression analysis, a set of strategy variables 
accounted for 40 percent of the variance in the relationship; environment 
accounted for 2 percent; and the interaction term was not significant. The 
_ results of the subgroup analysis revealed that environment served as a homolo- 
gizer for a set of strategy variables in that it modified the strength but not the 
form of the relationship between strategy and performance. Thus, environ- 
ment moderates this relationship because the set of strategy variables that 
are significantly related to performance varies across subenvironments. These 
results do not suggest that environment is not important to high performance, 
_ but rather that contingency theory and research should focus on identifying 
meaningful subenvironments and on examining strategy-performance re- 
lationships with and across these subenvironments. Environment is critical 
because it establishes the context in which to evaluate the importance of 
various relationships between strategy and performance. 
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APPENDIX 


The empirical development of eight environmental categories or subgroups based on 17 
characteristics of market structures employed cluster analysis; results were interpreted in the 
context of Porter's (1980) theoretical set of generic industries. The first five steps identified eight 
relatively independent and homogeneous subgroups. The final two steps involved interpreting 
the subgroups. 

Step 1 was selection of 17 environmental components based on theory and previous research 
(Scherer, 1980) and lack of multicollinearity. 

Step 2 was random selection of 311 business units. 

Step 3 was cluster analysis of the 311 business units (Ward's method; see Hambrick, 1983, 
and Wishart, 1978). 

Step 4 was chosing the number of subgroups. The criteria were: (1) examination of sharp 
changes in error sum of squares when the number of clusters is changed, and (2) visual inspec- 
tion of the dendogram (Everitt, 1974). Eight groups were identified. 

Step 5 was cross validation. Discriminant functions were developed for each of the eight 
groups identified in step 4. The discriminant functions were then used to classify 1,792 busi- 
ness units into the eight subenvironments. For a business unit to remain in the sample, it had to 
have a probability of group membership in any one group of .5 or greater. This criterion was 
chosen to ensure homogeneous groups. Of the 1,792 business units, 1,638 (91%) met the criterion. 
The average probability of group membership for the 1,638 business units was 88 percent. 

Step 6 was developing profiles for each subgroup based on both natural and standardized 
mean scores of the 17 environmental components. Descriptive statistical data on these compo- 
nents are available from the author. 

Step 7 was interpreting the subgroups in terms of Porter's (1980) typology of generic 
industries. 

The following describes key characteristics for the eight muon Descriptive statistical 
data are available from the author. 

1. Global exporting 

ROI = 23.88%. Key characteristics include: greater than 30 percent of sales represent exports; 

growth stage of product life cycle; 12 percent long-term industry growth; 10 percent real 

market growth; and auxiliary services important to end users. 
2. Fragmented (standard products) 

ROI = 14.51%. Key characteristics include: fragmented set of customers; annual product 

changes; purchases by end users annually; and low industry concentration. 
3. Stable, nonfragmented 

ROI = 23.01%. Key characteristics include: high market share stability among competitors; 

very little customer fragmentation as evidenced by 10 percent of customers accounting for 

50 percent of sales; purchase frequency is once per month; and new product development 

less than two years. 

4. Fragmented with auxiliary services 

ROI - 22.2896. Key characteristics include: low industry concentration; fragmented 

customers; purchases by end users annually; and high importance of auxiliary services. 
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5. Emerging : 
ROI = 20.31%. Key characteristics include: high market. share instability; early in product 
life cycle; 13 percent long-term industry growth; 19 percent real market growth; low percent- 
age of employees unionized; and no regular pattern of product changes. 

6. Mature 
ROI = 23.32%, Key characteristics include: industry concentration around 70 percent; mature 
stage of product life cycle; high growth rates for both materials and wages; and high percent- 
age of unionized workers. 

7. Glabal importing 
ROI = 28.69%. Key characteristics include: greater than 30 percent of sales represent imports; 
high concentration is 74 percent; and long term growth is 12 percent. 

8. Declining 
ROI = 20.39%. Key characteristics include: decline stage of product life cycle; real market 
growth is less than 1 percent; high value added/employee; high purchase frequency; long 
development time for new products; and auxiliary services unimportant. 


John E. Prescott earned his Ph. D. degree at the Pennsylvania State University; he is an 
assistant professor of business administration at the Graduate School of Business, Uni- 
versity of Pittsburgh. His current research interests include the interfacing of industrial 
organizational economics and strategy management, and industry and competitor 
analysis. 
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Drawing on the relevant literature, this study develops seven key di- 
mensions of planning systems, five reflecting their design aspects and 
two tapping the organizational context of planning. Discriminant anal- 
ysis is employed to evaluate the ability of these seven dimensions to 
distinguish between more and less effective planning systems, using 
three different criteria of planning effectiveness. Although influences 
on effectiveness seem to vary depending on the criterion of effective- 
ness used, overall the most important dimensions were system cap- 
ability, or a system's orientation toward creativity and control; re- 
sources provided for planning; and functional coverage. 


The relationship between strategic planning, which we will refer to as 
simply planning in this study, and organizational performance is one of the 
most extensively researched issues in the field of strategic management. An 
illustrative list of studies addressing the effects of planning on performance 
includes Ansoff, Avner, Brandenburg, Portner, and Radosevich (1970), Fulmer 
and Rue (1974), Grinyer and Norburn (1975), Herold (1972), Karger and Malik 
(1975), Kudla (1980), Lindsay and Rue (1980), Malik and Karger (1975), Rue 
and Fulmer (1973), Thune and House (1970), and Wood and LaForge (1979). 
Two studies provide complete reviews and critiques of this literature (Arm- 
strong, 1982; Lorange, 1979). ; 

Despite the large number of research efforts that attempt to elucidate 
links between planning and performance, the results of this body of research 
are fragmented and contradictory. This state of affairs has arisen because 
most previous research on the effectiveness of planning systems has suffered 


Portions of this paper were presented at the Fourth Annual Strategic Management Society 
Conference, Philadelphia, in October 1984. The authors would like to thank Jarmell Boyd, 
Patrick Gaughan, Linda Natal, and Loretta Riles for their administrative assistance during the 
data collection and analysis stages of the project on which this paper is based. The insightful 
comments and suggestions provided by the two anonymous reviewers are also gratefully 
acknowledged. 
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from two major conceptual shortcomings. First, most studies have used rather 
simplistic conceptualizations of the notion of planning. Thus, researchers 
have attempted to show differences in financial performance between 
“planners” and "nonplanners'" or “formal planners” and “informal planners." 
Second, most studies have been solely preoccupied with the linkage between 
planning and the financial aspects of corporate performance, even though 
conceptual writings on formal planning systems stress several nonfinancial, 
intangible benefits (Camillus, 1975; Steiner, 1979). 

With regard to the first issue, crude dichotomizations along single dimen- 
sions like the presence or absence of planning or its degree of formality are 
clearly suspect, because the debate as to what the term planning connotes is 
still a live one (Mintzberg, 1981; Snyder, 1982). Such conceptualizations are 
inconsistent with the multidimensional view of planning systems that is 
becoming more and more visible in the literature (e.g., King & Cleland, 1978; 
Lorange, 1979, 1980; Lorange & Vancil, 1977). Similarly, with regard to the 
second issue, an exclusive emphasis on financial performance is conceptu- 
ally flawed and is open to attack as being narrowly focused. A broader 
concept of effectiveness is needed. 

A truly meaningful assessment of the value of planning systems should, 
therefore, recognize their multidimensional nature and the plurality of 
approaches that can be used to assess their worth. This research study 
attempted such an assessment; it has three distinctive features: First, on the 
basis of an extensive literature review, we identify seven distinct design and 
contextual dimensions of planning systems that many previous researchers 
believe influence their effectiveness. Second, we explicitly recognize the 
many possible meanings of effectiveness in the context of planning and use 
multiple criteria for assessing the effectiveness of planning systems. Third; 
we cast the study in a comparative mode by using the technique of discrimi- 
nant analysis in order to bring the differences between more effective and 
less effective planning systems into sharp relief. 

The purpose of this paper is to test a two fold proposition: (1) relatively 
more effective systems will differ from relatively less effective ones along the 
seven key design and contextual dimensions referred to earlier in this section, 
and (2) the relative importance of these dimensions will vary depending 
upon the criterion of effectiveness used. 


DIMENSIONS OF PLANNING SYSTEMS 


Planning systems are multifaceted management systems that are contex- 
tually embedded. Hence, they cannot be adequately described in terms of 
one or two characteristics such as “formality.” Although employing differ- 
ent terminology, most writings (e.g., King & Cleland, 1978; Lorange & Vancil, 
1977; Steiner, 1979) have emphasized the notion that planning systems are 
best described in multidimensional terms. Unfortunately, there is at yet no 
consensus as to what these dimensions are. 

Nevertheless, in one way or other, most researchers emphasize two sets 
of influences on a planning sysiem’s effectiveness. These are the design 
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elements of the planning system and the organizational context of planning. 
We synthesized seven dimensions that tap these two aspects. Five design 
elements were included in this study. They are (1) system capability, (2) use 
of techniques, (3) degree of attention to internal facets, (4) degree of attention 
to external facets, and (5) functional coverage. The following two dimen- 
sions captured the organizational context of planning: (1) resources pro- 
vided for strategic planning, including top management support, and (2) 
resistance to planning or "anti-planning biases” (Steiner, 1979: 95—108). 
Although we could identify other dimensions that might conceivably influ- 
ence the effectiveness of planning, these seven have the support of an exten- 
sive literature. Also, they are variables that are relatively amenable to control 
by managers. 

Table 1 summarizes the seven dimensions and indicates the support in 
the literature for each of them. The following paragraphs provide a brief 
discussion of each dimension. 


Design Elements 


System capability. Strategic planning systems differ in the extent to 
which they emphasize creativity as opposed to contro] (Camillus, 1975; 
Shank, Niblock, & Sandalls, 1973). It has been stressed that these two some- 
what opposed orientations need to be balanced in order to enhance the 
effectiveness of a system. An excessive emphasis on creativity at the expense 
of control or vice versa is apt to be dysfunctional. We conceptualize system 
capability in terms of the ability of a system to foster control as well as 
creativity. The créative capability of a system is assessed in terms of its 
ability to anticipate surprises and crises, its ability to adapt to unforeseen 
changes, and so forth. On the other hand, the control aspect of a system 
refers to the degree of emphasis given to managerial motivation, upward and 
downward communication within an organizational hierarchy, integration 
of diverse operational areas, and the like. 

Use of techniques. A variety of analytical techniques and methodologies 
have been developed to aid managers in identifying and dealing with strate- 
gic decisions and problems (Grant & King, 1979). Use of these techniques is 
one indication of the extent of formalization of a planning process. The 
extent of reliance on various planning techniques and methodologies also 
indicates what an organization’s approach to decision making is; for instance, 
a comprehensive versus an incremental approach might be indicated (Fred- 
rickson, 1984). - 

The extent of reliance on planning techniques is thus an important dimen- 
sion of planning systems. Since organizations use formal techniques with 
the assumption that these techniques will enhance effective definition and 
resolution of the ill-structured problems of planning, we also expect that 
over time this dimension will influence the effectiveness of a planning sys- 
tem itself. 

Attention to internal facets. A formal approach to planning usually begins 
with an assessment of an organization’s recent history and current situation. 
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TABLE 1 


Dimensions of Planning Systems 





Dimensions 


Design elements 


System capability 


Use of techniques 


Attention to 
internal facets 


Attention to 
external facets 


Functional coverage 


Organizational context 
of planning 


Resources provided 
for planning 


Resistance to 
. planning 


Description 


The ability of a formal planning system 
to balance creativity and control; adap- 
tive flexibility of a system and its cap- 
ability to support strategy formulation 
and implementation. 


The degree of emphasis given to the 
use of planning techniques to structure 
ill-defined, messy, strategic problems. 


The degree of attention to internal 
(organizational) factors, past perfor- 
mance, and analysis of strengths and 
weaknesses. 


The level of emphasis given to mon- 
itoring environmental trends.. 


The extent of coverage given to dif- 
ferent functional areas with a view 
to integrating different functional 
requirements into a general manage- 
ment perspective. 


The degree of organizational support 
in the form of number of planners, 
involvement of top management in 
planning, etc. 


The need to anticipate and overcome 
resistance to planning and to create a 
favorable climate ior effective 
planning. 


Key Supporting Literature 


Ansoff (1975, 1984) 
Anthony & Dearden (1975) 
Camillus (1975) 

Lorange & Vancil (1977) 
King & Cleland (1978) 
Thompson (1967) 


Grant & King (1979, 1982) 
Hofer & Schendel (1978) 
Hax & Majluf (1984) 


Camillus & Venkatraman 
Grant & King (1982) 
King & Cleland (1978) 
Lorange & Vancil (1977) 
Stevenson (1976) 


Aguilar (1965) 

Fahey & King (1977) 

Keegan (1974) 

Kefalas & Schoderbek (1973) 
Thomas (1980) 


Hitt, Irland, & Palia (1982) 
Hitt, Ireland, & Stadter ( 1982) 
Lorange (1980) 

Snow & Hrebiniak (1980) 


King & Cleland (1978) 
Steiner (1979) 


Steiner (1979) 

Steiner & Schollhammer 
(1975) 

Schultz & Slevin (1976) 





Previous studies have variously referred to this step or stage in planning as a 
“situation audit," an "appraisal," and so forth (Ansoff, 1965; Steiner, 1979). 
In this assessment, organizations' internal capabilities receive close study to 
identify their strengths and weaknesses. Often plans fail because of an inade- 
quate or incorrect assessment of an organization's internal aspects (King & 
Cleland, 1978). This study treats degree of attention to internal aspects such 
as past performance, current strengths and weaknesses, and diagnostic assess- 
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ment of recent failures or performance shortfalls as a distinct and important 
influence on the effectiveness of an organization’s planning. 

Attention to external facets. One of the purposes of planning is to adapt 
- organizations to environmental demands and pressures. Analyzing external 
opportunities and threats is a major element of planning (Andrews, 1971). 
There is no need to belabor the importance the literature grants external 
orientation as an influence on the effectiveness of planning. This study 
therefore includes the extent of attention devoted to various aspects of envi- 
ronments as a further aspect of the design of planning systems. 

Functional coverage. This term refers to the degree of emphasis accorded 
to various functional areas in planning. Functional coverage can vary because 
of strategic differences in the competitive postures of firms in an industry. 
Some firms may attempt to compete on the basis of price and volume, and 
others may emphasize product differentiation and customer service. The 
distinctive competencies demanded by these alternate approaches are usually 
manifested as patterns of emphasis on different functional areas (Hitt, Ireland, 
& Palia, 1982; Snow & Hrebiniak, 1980). In the first case, manufacturing 
efficiencies may be key, but in the second case the marketing function may 
receive special emphasis. A balanced emphasis on all functions may be more 
important in yet other situations in which general management is the key 
success factor. Therefore, we included the degree of emphasis given to vari- 
ous functional areas as a dimension of the design of planning systems. 


Organizational Context of Planning 


Resources provided for planning. Many authors have emphasized that 
planning in an organization cannot be successful unless adequate resources 
are committed to that activity (King & Cleland, 1978; Steiner, 1979). These 
resources may be physical—-a separate planning staff and office; or they may 
be intangible—the time spent by the CEO and other key managers in the 
planning function. Planning is not a costless activity. If organizations expect 
benefits from planning, they must also be willing to incur commensurate 
levels of tangible and intangible costs of doing effective planning. 

Resistance to planning. Early research on planning systems emphasized 
the importance of identifying and overcoming sources of resistance to 
organizational planning (Steiner, 1979; Steiner & Schollhammer, 1975). 
Organizational members may show resistance to a system in many ways; 
some examples are lack of acceptance of the outputs of planning, withdrawal 
or nominal participation in planning activities without active involvement 
on the part of operating managers, and gaming behaviors. Resistance to the 
idea and processes of planning can be expected to exert a negative influence 
on the effectiveness of planning systems. 


MULTIPLE CRITERIA OF EFFECTIVENESS 


This study approached the effectiveness of planning systems from three 
perspectives. First, we examined the extent of fulfillment of key planning 
objectives. Second, in keeping with most prior research, we included an 
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organization’s economic performance as a further criterion of effectiveness. 
Third, we applied an overall measure of satisfaction with a planning system. 


Fulfillment of Planning Objectives 


Six commonly emphasized objectives of planning were considered. 
These were (1) predicting future trends, (2) evaluating alternatives, (3) 
enhancing management development, (4) avoiding problem areas, (5) improv- 
ing short-term performance, and (6) improving long-term performance. The 
following subsections discuss and justify each of these objectives. 

Predicting future trends. Planning is often justified on the grounds that 
contemporary organizational environments have become increasingly tur- 
bulent, thereby necessitating some formal mechanisms for environmental 
monitoring and coping with environmental change (Ansoff, 1984; Gluck, 
Kaufman, & Walleck, 1980, 1982). An objective that follows from this reason- 
ing is improvement of the ability of a planning system to predict future 
trends. Proponents of such capabilities do not suggest that planning should 
necessarily result in accurate predictions of the future, but that planning 
should at least help organizations to delineate probable, plausible, and pref- 
erable future states of the world (Amara, 1981). In fact, according to one 
investigation (Paul, Donavan, & Taylor, 1978), one of the major problems 
with planning is the inability of planners to produce reasonably valid fore- 
casts of the future. Thus, predicting future trends is considered an impor- 
tant task of planning and constitutes our first objective. 

Evaluating alternatives. Critics of planning always point out the ten- 
dency of planning to degenerate into a numbers game. À good planning sys- 
tem, it has been proposed, should not only serve as a vehicle for mind- 
stretching (Camillus, 1975), but also should delicately balance control and 
creativity (Shank, Niblock, & Sandalls, 1973). The extent to which a planning 
system fosters the creative generation and exploration of alternative courses 
of action thus becomes a further important test of its effectiveness, and it is 
second in our list of objectives. 

Avoiding problem areas. Yet another approach to the evaluation of 
planning systems is to examine the extent to which they result in an ac- 
cumulation of experience and enhance organizational learning (Shrivastava 
& Grant, 1985). An evolutionary approach to systems design (Lorange & 
Vancil, 1977) is consistent with the idea that effective planning systems are 
adaptive learning systems. Learning can be said to be occurring if a system 
increases the probability of achieving goals and minimizes recurrence of 
errors. A corollary is that an effective planning system should enable an 
organization to avoid problem areas. This is the third objective for planning. 

Enhancing management development. Formalization of planning should 
lead to an improvement in the quality of management and facilitate manage- 
ment succession. This constitutes the long-term educational value of the 
planning process (Hax & Majluf, 1984; Lorange & Vancil, 1977). Planning’s 
contribution to the development of the depth of management in an organiza- 
tion is by no means an easy achievement to evaluate. Yet, some attention to 
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this important objective appears to be warranted in evaluating planning 
systems. This is the fourth objective. 

Improving short-term and long-term performance. Finally, the impor- 
tance of performance, both short-term and long-term, cannot be over- 
emphasized. In a sense, improving performance is the raison d'etre for adopt- 
ing elaborate planning systems in the first place. This is implicit in the 
exclusive focus on performance that characterizes the impressive body of 
studies on evaluation of planning systems identified earlier. These are the 
fifth and the sixth objectives. 


Performance Relative to Competition 


In addition to the achievement of key planning objectives, we posited 
that effective systems would improve organizational performance. There is a 
subtle difference between an organization’s achieving performance objec- 
tives and performing at a relatively higher level. Hence, we included relative 
performance as a separate criterion of planning effectiveness. This study 
used four performance indicators, each measured relative to competitors 
rather than in absolute terms. These indicators are growth in sales, growth in 
earnings, changes in market share, and return on investment. 


Satisfaction with Planning Systems 


Adopting the approach common in the literature concerning the imple- 
mentation of management information systems (Lucas, 1978), we treated 
satisfaction with planning systems as an additional criterion of effectiveness. 
Satisfaction is particularly important when use of a system is mandatory 
rather than voluntary, which is likely to be the case for an organization’s 
planning system. Satisfaction with a system is likely to reflect fulfillment of 
objectives; thus, satisfaction can also be regarded as an additiorial internal 
validation criterion for the variables measuring fulfillment of objectives. 


D 


RESEARCH METHODS 
Data Collection 


A detailed questionnaire on their companies’ planning practices was 
mailed to 600 executives of Fortune 500, Fortune 500 Service, and Inc 500 
companies. We received completed questionnaires from 207 executives rep- 
resenting both planning and other operating functions, for a response rate of 
34.5 percent. This level of response from a list of predominantly Fortune 
1000 firms is higher than is usual for mail surveys of this population (Gaedeke 
& Tootelian, 1976). Table 2 shows characteristics of respondents and their 
companies. 

The data show bias in favor of larger firms, with 60.9 percent of respond- 
ing firms having annual sales in excess of $1 billion. Similarly, planning 
executives (68.5%) and respondents at the corporate levels of their organiza- 
tions (81.5%) predominated. 
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, TABLE 2 
Characteristics of Respondents and Their Companies? 





Responses Used 
All Respondents in This Study? : 
Characteristics (n = 207) . {n = 93) 


Sales of responding units 


Less than $250 million 17.4 244 
$251 million - $1 billion 21.8 21.1 
Over $1 billion . 60.9 54.4 
Primary business categories of responding units 
Consumer goods f 19.2 13.0 
Capital goods 18.7 20.7 
Raw or semifinished materials , 12.8 10.9 
Components for finished goods 8.9 12.0 


Service sector 40.4 43.5 


Organizational level of responding units 


Corporate 81.5 80.4 

Business unit 18.5 19.6 
Title/job responsibilities of responding executives 

Planning 68.5 67.0 

Other line function 31.5 33.0 
Maturity of planning systems (Year planning began) 

Before 1960 1 6.2 5.9 

1961-1975 36.7 37.6 

After 1976 55.2 56.5 


a All figures are percentages. Nonrespondents have been excluded. 

b A subset of respondents was used for analysis in this study because of missing data. On 
the basis of chi-square tests, the subset used is not statistically different from all respondents 
with respect to any of the five company and respondent characteristics summarized in this 
table, except sales categories. 


Since smaller firms do not usually undertake planning to any significant 
extent (Robinson & Pearce, 1984), this bias toward largeness is not likely to 
be a serious detriment to drawing tentative generalizations about the plan- 
ning processes of firms that do plan. A comparison of the responses of 
operating executives and planning executives using t-tests indicated that 
planning executives as a group did not tend to systematically overrate or 
underrate their planning systems. However, since a single respondent from 
each organization provided data, there is a real possibility of response bias or 
selection bias. A trade-off between number of respondents overall and num- 
ber of responses per firm became inevitable in view. of limitations of our 
resources, and we leaned in favor of a large overall number of respondents. 
Because our target respondents were top-level planning or operating exe- 
cutives, we assumed that they would be quite knowledgeable about their 
planning systems and generally unbiased in their responses. 
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Measurements 


The seven systems dimensions were measured by multi-item scales. The 
lack of valid and reliable measurement scales has been one of the impedi- 
ments to the development of middle-range theories of strategy and strategic 
planning (Venkatraman & Grant, 1986). This study sought to assure content 
validity of each dimension by use of multiple experts, including the authors, 
and an iterative procedure for ensuring exhaustive coverage of each construct’s 
domain. Our use of multi-item scales was motivated by the aim of enhancing 
the reliability of measurements (Nunnally, 1978). 

Effectiveness was measured using both single-item and multi-item scales. 
The six individual variables measuring fulfillment of objectives and their 
simple additive composite made up the objective fulfillment battery. The 
four relative performance items and their composite constituted the relative 
performance battery. Satisfaction was measured by a single-item scale. All 
variables were measured on appropriately anchored 5-point Likert scales. 

The Appendix describes the multi-item scales used in this study. Table 
3 reports means, standard deviations, reliabilities, and intercorrelations for 
the planning system dimensions. Table 4 shows corresponding statistics for . 
the variables measuring effectiveness and composites. 

It should be noted that the dimensions of planning systems used in this 
study were specified a priori because of extensive support from the literature 
(see Table 1). Nevertheless, we undertook factor analysis of the variable set 
to determine if any further parsimony could be accomplished in the number 
of dimensions used. We prepared analyses ranging from a three- to a seven- 
factor solution and carefully studied the varimax-rotated factor scores. Since 


TABLE 3 
Means, Standard Deviations, Reliabilities, and Intercorrelations 
of the Seven Dimensions of Planning Systems? 





Dimensions Means s.d. 1 2 3 4 5 6 7 

1. System capability 3.65 0.62 (93) 
2. Use of techniques 3.18 045 60 (71) 
3. Attention to internal 

facets 3.86 0.61 52 45 (68) 
4, Attention to external 

facets 3.62 0.50 45 22 19 (51) 
5. Functional coverage 3.57 0.36 38 28 34 47 (36) 
6. Resources provided 

for planning š 3.61 0.74 64 49 36 36 40 (76) 
7. Resistance to r 

planning 1.66 0.61 -69 -43 -28 -41 .-—33 —70 (70) 


8A]! values are based on data from 93 firms used in the discriminant analyses. Decimal 
points are omitted for reliabilities and correlations. Reliabilities (Cronbach alphas) are 
shown within parentheses. All correlations are significant at p <.05. 
> The reliability for this dimension for all respondents (n = 207) was 0.77. Although there 
were differences for the other dimensions as well, they were less pronounced. 
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none of the factor structures were easy to interpret, we decided to rely on the 
seven originally specified dimensions. 


Choice of Analytic Technique 


Given our interest in exploring the dimensions of systems contributing 
to differences in effectiveness between more and less effective systems, we 
deemed discriminant analysis to be the appropriate analytical approach. 
Although use of another statistical technique, such as multiple regression, 
may seem more appropriate because the effectiveness variables are interval 
measures, an examination of the correlation matrix for the set of indepen- 
dent variables revealed significant multicollinearity in the data. This further 
reinforced our original decision to use discriminant analysis as the analyti- 
cal method, because multicollinearity does not affect interpretation of the 
results of discriminant analysis, despite what many researchers believe 
(Eisenbeis, 1977).! 

Groupings for discriminant analysis. The effectiveness groupings for 
discriminant analysis were constructed as follows. Since responses were 
made on 5-point scales, all respondents choosing 1, 2, or 3 were placed in 
one group, and those checking 4 or 5 were placed in the other group.? In the case 
of the two composites, we used the median as the cut point to yield the two 
groups for discriminant analysis. 

Tests of multivariate normality. The accuracy of the classifications that 
result from applying discriminant analysis are sensitive to the assumption 
that the predictor variables constitute a multivariate normal population. To 
test for this, we employed the z-tests for the significance of the coefficients of 
skewness and kurtosis discussed by Bock (1975). The discriminating vari- 
ables used in this study displayed significant deviation from normality,? 
suggesting caution in interpreting the accuracy of the classifications reported. 
However, our primary concern was with descriptive rather than predictive 
relevance (Frank, Massy, & Morrison, 1965). We therefore do not believe that 
the failure of a key model assumption, which would be critical for prescrip- 
tive studies, constitutes a serious challenge. 

Analysis. A total of 13 discriminant analysis runs were performed. Group- 
ings in seven runs were based on the variables measuring fulfillment of 
objectives and their composite; in five runs, the performance variables and 
their composite; and in one run, the measure of overall satisfaction. As seven 


' "Thirteen regressions were nevertheless obtained. They yielded statistically significant 
models with multiple R values in the range 0.42 to 0.75. However, the signs of the coefficients of 
the independent variables were uninterpretable because of multicollinearity in the data, which 
is likely due to common method variance arising from our method of data collection. 

?Before assigning the neutral responses (scored 3) to the low effectiveness groups, we ran 
ANOVAs for each grouping variable for all seven discriminating dimensions. Scheffé pairwise 
comparisons across the three groupings showed that the neutral groups were more similar to the 
low effectiveness groups than to the high effectiveness groups, thus justifying the pooling of the 
neutral ratings with the low effectiveness ones. 

?This problem persisted even when various data transformations were tried. The detailed 
results of this analysis are not reported here in the interest of conserving space. 
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discriminators were composite dimensions, missing data on even one of the 
component variables for any respondent led to the loss of that case from our 
analysis; 93 cases had complete data for constructing all the seven planning 
system dimensions. A comparison of these 93 cases with the 114 incomplete 
cases with respect to various characteristics confirmed that this data reduc- 
tion did not lead to any significant biasing in the subset used for discrimi- 
nant analysis (Table 2). 


RESULTS AND DISCUSSION 
Overall Results 


Table 5 shows the results for the discriminant analysis using satisfaction 
as the effectiveness criterion and serves as a prototype for Tables 6 and 7, 
which summarize the results for the variables measuring fulfillment of 
objectives and relative performance, respectively. 

Of the 13 discriminant functions obtained, 12 were significant at p < 
.05. The run for growth in earnings was significant at p < .10. It is encourag- 
ing that the same seven dimensions lead to significant discriminant func- 
tions for each of the effectiveness criteria used. To a degree, this is not 
surprising. The variables measuring fulfillment of objectives were all highly 
intercorrelated, as were those measuring relative performance (Table 4). In 
addition, the objective fulfillment variables had high correlations with the 








TABLE 5 
Results of Discriminant Analysis for Groupings Based on Satisfaction 
Criteria Results 
Group sizes 
Number of dissatisfied planners (group 1) 42 
Number of satisfied planners (group 2) 51 
Significance level of the linear discriminant function .01 
Assumption of equality of group dispersion matrices Not valid 
(p for Box's M) (.01) 
Percent classified accurately by linear classification rule? 
Group 1 78.6 
Group 2 84.3 
Overall 81.7 
Percent accuracy of chance model based on sample group prior 
probabilities 50.5 
Standardized discriminant function coefficients 
System capability —0.33 
Use of techniques —0.07 
Attention to internal facets 0.05 
Attention to external facets 0.50 
Functional coverage —0.17 
Resources provided for planning —0.23 
Resistance to planning 0.71 


* The failure of Box's M test for equality of group dispersion matrices suggests that quadratic 
classification procedures might be more appropriate. The use of a quadratic clarification 
rule, however, did not lead to a noticeable improvement in classification accuracies. 
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satisfaction variable, as expected. The generally low and nonsignificant cor- 
relations between the objective fulfillment and relative performance variables 
does confirm our initial treatment of the two as distinct criteria of effective- 
ness. Thus, the uniformly good discrimination achieved across all the runs 
attests to the value of our seven dimensions of planning systems in classify- 
ing planning systems along either type of effectiveness criterion. 

Equality of group sizes is an important consideration in interpreting bias 
in discriminant analysis (Morrison, 1969). The grouping procedure used in 
this study yielded nearly equal groups, with the smaller groups accounting for 
no less than 45 percent for all but the runs based on alternative evaluation, 
long-term performance, growth in sales, and growth in earnings. For these 
runs, the accuracies of chance models used as benchmarks are accordingly 
higher. In other words, we compared most of our discriminant functions’ 
accuracies in classification against relatively sound test values in judging 
their worth. As can be noted in Table 6, the most unequal group sizes resulted 
for evaluating alternatives. Even in this case, the discriminant function's 
overall classification accuracy of 66.7 percent is somewhat better than the 

' 63.9 percent expected accuracy of the corresponding chance model. For 
growth in earnings, the discriminant function's overall classification accu- 
racy was about 9 percentage points better than that of the corresponding 
chance model. In all other cases the overall classification accuracy of the 
discriminant functions was at least 15 percentage points higher than that of 
the corresponding chance models. Thus, the results of the discriminant anal- 
ysis runs are informative. 

Traditionally, scholars have believed that the absolute magnitudes of the 
standardized discriminant function coefficients of discriminating variables 
reflect their relative importance. These magnitudes have been considered to 
be analogous to the beta weights in a regression model (Klecka, 1975). How- 
ever, some methodologists question this approach. Although many alterna- 
tive ranking schemes exist, this study used the procedure suggested by 
Mosteller and Wallace (1963) for ranking the discriminating dimensions.* 
Table 8 summarizes the relative importance rankings for the 13 analyses. 


Satisfaction with Planning Systems 


. For the grouping based on satisfaction with planning systems, the three 
top-ranked discriminators were resistance to planning, system capability, 
and resources provided for planning. Attention to internal facets ranked as 
the least important discriminating dimension. These findings suggest that 
satisfaction with planning is largely a function of its organizational context. 

On a univariate basis, the low-satisfaction group differed from the high- 
satisfaction group along all but one dimension: attention to external facets. 


‘In this procedure, the variables are ranked on the basis of the absolute value of the product 
bj (X ox p where bj is the unstandardized discriminant function coefficient of the variable j, 
Xa is the mean of this variable for group 1, and X — is the mean for group 2. For discussion of 
the superiority of this procedure over the conventional use of the standardized discriminant 
coefficients for ranking, see Mosteller and Wallace (1963) or Joy and Tollefson (1975). 
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However, in the discriminant analysis, attention to external facets emerged 
as the fourth most important discriminator. 

If use of techniques, attention to internal facets, and functional coverage 
can be considered to represent aspects of a planning process that tap its 
degree of formality, then it appears that formality of planning and satisfac- 
tion are unrelated or only weakly related. 

The implications of these findings for managers concerned with enhanc- 
ing satisfaction with planning would appear to be to pay specific attention to 
the organizational context of planning, particularly to any sources of resis- 
tance to planning, and to the level of resources committed to planning 
activities. Enhancing the external orientation of a planning system also 
appears to contribute to satisfaction with planning. Presumably, organiza- 
tional members perceive externally oriented planning systems as conducive 
to meaningful planning exercises and therefore such an orientation increases 
satisfaction with planning. Of course, these are conjectures subject to further 
empirical testing, since the results of discriminant analysis can only yield 
tentative clues as to causal linkages. 


Fulfillment of Objectives 


Turning to the analyses for the variables measuring fulfillment of 
objectives, system capability appears to be the key discriminator overall. It 
ranked first in four of the seven analyses of objective fulfillment. However, 
it ranked last in the analysis for short-term performance and sixth for long- 
term performance. Thus, system capability appears to be more related to the 
process-based as opposed to the outcome-based objectives of planning. Fur- 
ther support for this conjecture is provided in the fact that system capability 
ranked near the bottom in three of the five analyses based on the variables 
measuring relative performance. But it ranked second when the grouping for 
discriminant analysis was based on the performance composite (Table 8). 

Although resistance to planning never ranked as the top discriminator in 
the analyses of objective fulfillment, it never ranked lower than fourth, and 
in five analyses it ranked either second or third. Thus resistance to planning 
appears to be a key discriminator for fulfillment of objectives. 

Use of techniques, functional coverage, and resources provided for plan- 
ning ranked among the top three discriminators in three analyses each. Each 
ranked at the bottom in one. Although use of techniques was top-ranked in 
the analyses predicting future trends and short-term performance, it ranked 
last in that for enhancing management development. Thus, the contributions 
of techniques seem limited to the relatively more tangible objectives. Func- 
tional coverage ranked second in the analysis for enhancing management 
development and third in those predicting future trends and the objective 
composite. Why functional integration should improve prediction of future 
trends is not clear, but it is intuitively tenable that such integration can lead to 
management development. Functional emphasis and integration can result 
in the identification of key success factors and in developing a general man- 
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agement perspective. Such integration can also be expected to reduce conflict 
across functional boundaries. Resources provided for planning was found to 
be an important discriminator for the performance objectives and for stimu- 
lating a wide variety of alternatives and their evaluation. 

Attention to internal facets as well as attention to external facets appear to 
be the least important discriminators as far as the variables measuring fulfill- 
ment of objectives are concerned. However, attention to external facets ranked 
relatively high with reference to the performance objectives, and attention to 
internal facets achieved its highest ranking in the analyses for measures of 
avoiding problem areas and evaluating alternatives. 


Relative Performance 


Resources provided for planning emerged as the key discriminator in the 
analyses of relative performance. It ranked consistently as the first or second 
factor. In contrast, use of techniques had consistently low rankings. It ranked 
last in two analyses, namely those for growth in earnings and return on 
investment, and in no analysis did it rank higher than fourth. This finding 
lends support to recent criticisms of the technique orientation in contempo- 
rary management (Peters & Waterman, 1982), and particularly to the attacks 
agaiust popular planning techniques such as PIMS and BCG (Kiechel, 1982; 
Wensley, 1982). Techniques have a role to play in strategic planning, but the 
results of this study suggest that effective planning requires more than merely 
using sophisticated analytical techniques. 

The consistent and relatively low ranking of resistance to planning in 
the analyses for relative competitive performance is also an interesting finding. 
Early writings on planning placed considerable emphasis on the resistance 
issue. It is tempting to speculate that such emphasis, although appropriate in 
the early days of planning, is perhaps less critical today. Today planning has 
increasingly been recognized and accepted as a line manager's job (Business 
Week, 1984). The issue of introducing it to organizations and securing the 
commitment of managers may have simply become less crucial. This does 
not, however, miminize the need for managing organizational resistance to 
planning. 

Attention to external facets ranked at the top in the analyses for growth 
in earnings and changes in market share, and was third in the analysis for 
growth in sales. Thus, an external orientation appears to be important for 
effectiveness in relative competitive performance. However, attention to exter- 
nal facets was not a key discriminator for return on investment and the 
performance composite. 

Both attention to internal facets and functional coverage showed consid- 
erable instability in their rankings from analysis to analysis. The former's 
high ranking in the analysis for growth in sales is not easy to interpret; the 
high ranking of functional coverage in the analysis for return on investment 
is explainable as possibly due to the efficiency achieved by good functional 
coordination. 
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A Synthesis 


Considering all analyses together, system capability and resources pro- 
vided for planning each ranked at the top in four analyses. System capability 
tanked as one of the three top discriminators in eight analyses and resources 
in nine. System capability ranked at the bottom in only two analyses and 
resources in only one. Thus, overall there is little doubt that system capabil- 
ity and resources provided for planning are key dimensions when the effec- 
tiveness of planning systems is construed in the broadest sense. 

Although it ranked at the top in only one analysis, resistance to planning 
was never ranked at the bottom and was among the top three discriminators 
in six of the analyses. However, its discriminating influence was limited to 
the analyses of objective fulfillment and satisfaction. In contrast, functional 
coverage, which also ranked among the top three discriminators in six 
analyses, was found to be an important discriminator in three of the seven 
analyses of fulfillment of objectives and three of the five analyses of relative 
performance. We are therefore inclined to consider functional coverage to be 
a more important discriminator than resistance to planning overall. 


IMPLICATIONS 


The importance of the context of planning is underscored by the fact that 
each of the three criteria of effectiveness employed in this study appears to 
be influenced significantly by at least one of the two contextual dimensions. 
Both resources provided for planning and resistance to planning were key 
discriminators for satisfaction as well as objective fulfillment, whereas 
resources ranked consistently at the top for relative competitive performance. 
Future studies should include more contextual dimensions than the two 
used in this study. Possible candidates include environmental volatility, 
leadership styles, and maturity of planning systems. 

Among the design elements, system capability proved to be a powerful 
discriminator for satisfaction and objective fulfillment, but proved less impor- 
tant for relative performance—except for the performance composite. In 
contrast, attention to external facets was an important discriminator for rela- 
tive performance, but not for objective fulfillment. Functional coverage was 
an important discriminator for both objective fulfillment and relative 
performance. These results suggest that the importance of design elements 
varies considerably depending on the criterion of effectiveness used. The 
implication of this conclusion for systems design is that efforts to create 
supersystems capable of achieving numerous purposes at the same time are 
unlikely to be productive and that systems design should be tailored to 
specific purposes of planning. Trade-offs are inevitable if a planning sys- 
tem is to meet multiple aims (King, 1983). 

The implication of the overall importance of resources and functional 
coverage is straightforward. Effective planning requires an infusion of ade- 
quate resources to planning efforts. This in turn requires high levels of involve- 
ment on the parts of not only staff managers but also operating managers and 
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chief executive officers. As for functional coverage, a high score along this 
dimension would result from uniformly high responses on all the compo- 
nent items in this scale. This implies that functional integration and a gen- 
eral management perspective are crucial for effective planning. 

The finding that system capability emerged overall as the most impor- 
tant discriminator raises important questions about the roles of creativity 
and control as key considerations in the design of planning systems. Some 

' component items of this dimension tapped an orientation toward creativity 
and others tapped an orientation toward control. Although it is conventional 
to think of creativity and control as opposed orientations, our data suggested 
that the two tend to go together, as Peters and Waterman implied in their 
discussion of "simultaneous loose-tight properties" (1982: 318—325) in the 
context of organizational excellence. This was the reason we included aspects 
of creativity and control within the same dimension. For the present, we 
suggest that effective planning is achieved by a high degree of attention to 
both creativity and control, as opposed to trading the one off against the 
other. We are currently subjecting this hypothesis to further empirical analy- 
sis by examining interactive models of the creativity and control subdimen- 
sions of the system capability construct. 


CONCLUSIONS 


Most prior research on the effectiveness of planning systems has pur- 
sued the implicit question, ‘‘Does planning pay?” In contrast, this study was 
based on the premise that planning is a multidimensional and indistinguish- 
able feature of overall management processes, and hence we pursued the 
question, “What dimensions of planning are associated with effectiveness as 
approached from multiple perspectives?" 

The results of this study suggest that determinants of the effectiveness of 
planning systems tend to vary depending on the specific criterion of effec- 
tiveness used. Key dimensions overall seem to be system capability, resources 
provided for planning, and functional coverage. 

Every effort was made to use an appropriate methodology and to execute 
it as rigorously as possible. However, a key limitation of this study is our 
reliance on data from a single respondent in each firm. This may have intro- 
duced response or function bias to the data, and it may raise questions about 
the generalizability of our findings, since organizational constructs like these 
are probably best measured with data from multiple respondents drawn 
from different functional and hierarchical levels. Also, we cannot be sure 
there was no bias resulting from nonresponse and incomplete data. 

There are several possible extensions of this study. One line of inquiry 
would be to examine in detail the propositions suggested by the results 
summarized in Table 8. A second extension would be to examine the interre- 
lationships among the dimensions of planning systems, instead of treating 
them as separate entities as we did. Third, future research could consider the 
seven systems dimensions jointly to develop rich empirically based taxo- 
nomies of planning systems, and their implications for effectiveness could 
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be further explored. Lastly, future research could simultaneously analyze 
the relationships between the seven dimensions and the three criteria of 
effectiveness using approaches such as multivariate analysis of variance and 
canonical correlation analysis (Ramanujam & Venkatraman, 1985). 

We believe that research on planning systems is at a critical crossroads. . 
We hope this study will stimulate further efforts in new directions and 
enhance understanding of the role and value of planning systems in the 
formulation and implementation of strategies. 
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APPENDIX 


Responses for all items were measured on 5-point scales. Items followed by (R) were reverse- 
coded to indicate resistance. 


Dimensions of planning systems. System capability was measured on a scale ranging from 
"much improvement" to “much deterioration" with the following 12 items: ability to anticipate 
surprises and crises; flexibility to adapt to unanticipated changes; value as a mechanism for 
identifying new business opportunities; role in identifying key problem areas; value as a tool for 
managerial motivation; capacity to generate new ideas; ability to communicate top management's 
expectations down the line; value as a tool for management control; capacity to foster organiza- 
tional learning; ability to communicate line management's concerns to top management; value 
as a mechanism for integrating diverse functions and operations; and value as a basis for 
enhancing innovation. In addition, the following item, scaled from "strongly disagree" to 
"strongly agree," was included: "Today's system emphasizes creativity among managers more 
than our previous system."' 

Use of techniques was measured on a scale ranging from “significant decrease in use" to 
"significant increase in use" with the following nine items: portfolio (e.g., BCG) approaches, 
PIMS models, financial models, zero-based budgeting, value-based planning, project manage- 
ment techniques (e.g., PERT/CPM), stakeholder analysis, scenarios/delphi techniques, and fore- 
casting and trend analysis. 

Attention to internal facets was measured on a scale ranging from "significantly less 
emphasis" to "significantly more emphasis" with the following three items: internal capabilities, 
past performance, and reasons for past failures. 

Attention to external facets was measured on a scale ranging from “significantly less 
emphasis" to "significantly more emphasis" with the following five items: general economic 
and business conditions, regulatory issues, worldwide competitive trends, supplier trends, 
customers and end user preferences, and technological trends. 

Functional coverage was measured on a scale ranging from “significantly less emphasis" to 
"significantly more emphasis" with the following seven items: marketing function, operations 
and manufacturing function, finance function, personnel function, purchasing and procure- 
ment function, R&D and technology, and computers and MIS, 

Resources provided for planning was measured on a scale ranging from “significant decrease” 
to "significant increase" with the following four items: number of planners, time spent by the 
chief executive officer in strategic planning, involvement of staff managers in strategic planning, 
and resources provided for strategic planning. 

Resistance to planning was measured on a scale ranging from “significant decrease" to 
“significant increase" with the following five items: overall emphasis on strategic planning {R}, 
involvement of line managers in strategic planning (R), acceptance of the outputs of strategic 
planning exercise by top management (R), resistance to planning in general, and threats to the 
continuation of strategic planning. 


Effectiveness of planning systems. Fulfillment of objectives over the past five years was mea- 
sured on a scale ranging from “entirely unfulfilled” to “entirely fulfilled" with the following six 
items: predicting future trends, evaluating alternatives based on more relevant information, 
avoiding problem areas, enhancing management development, improvement in short-term 
performance, and improvement in long-term performance. 
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Performance relative to competition over the past five years was measured on a scale 
ranging from “much worse" to "much better" with the following four items: growth in sales, 
growth in earnings, changes in market share, and return on investment. 
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This study explored the effects of layoffs on survivors. We assessed 
subjects’ work performance as a function of whether a co-worker had 
been laid off and the circumstances of that layoff. Consistent with equity 
theory, subjects worked harder when they believed that a co-worker's 
dismissal was based on a random process rather than on the relative 
merits of their and their co-worker's prior performance. Data drawn 
from questionnaires lent further support to equity theory as an explana- 
tion of these results. 


Many organizations resort to layoffs during periods of financial strain in 
obvious attempts to reduce personnel costs. Less obvious though perhaps no 
less important are the likely consequences that layoffs have for the work 
behavior and attitudes of survivors, those employees who are not laid off. To 
date relatively little research has explored the effects of layoffs on survivors. 
The present study was designed to address this deficiency. 

Several psychological perspectives offer some insight into the effects of 
layoffs on survivors. For example, layoffs may elicit anxiety produced by job 
insecurity that may in turn influence the performance of survivors. Indeed, 
the relationship between anxiety and performance has long been a topic of 
interest to experimental psychologists (Spence, Farber, & McFann, 1956). 

More germane to the present study, the voluminous literature on distrib- 
utive justice (Deutsch, 1975; Folger & Greenberg, 1985; Leventhal, 1980; 
Tyler & Caine, 1984) may also elucidate the effects of layoffs on survivors. 
Distributive justice concerns the principles decision makers rely on in 
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allocating scarce resources. According to Deutsch and others, the relative 
salience of these principles depends on the goals that a group or organiza- 
tion is trying to achieve. In business organizations the primary goal is eco- 
nomic productivity; thus equity—trather than equality or need—is most likely 
to be the salient norm. Accordingly, the vast body of research on equity 
theory (Adams, 1965) may apply to the current analysis. 

Much of the previous research on equity theory in industrial and organi- 
zational psychology has evaluated the effects of positive inequity on workers’ 
reactions (Mowday, 1979). Positive inequity refers to workers’ perceiving that 
the ratio between outcome and input is higher for them than it is for relevant 
others. Two basic notions in equity theory are that positive inequity (1) 
arouses guilt, and (2) motivates individuals to redress this guilt through 
behavioral or psychological means. For example, Adams (1965) and others 
have shown that positive inequity may cause employees to work harder, 
presumably in order to redress the perception that they are taking advantage 
of their employers. 

It is our contention that layoffs may cause survivors to experience posi- 
tive inequity, especially if they believe that they could have just as easily 
been dismissed as their co-workers. Thus, contrary to the procedure employed 
in most studies of positive inequity, which create positive inequity by 
increasing workers’ outcomes in relation to those of comparison others, sur- 
viving a layoff may produce pasitive inequity by decreasing outcomes for 
comparison others. 

In fact, a recent study by Brockner, Davy, and Carter (1985) tested the 
hypothesis that the dismissal of a co-worker could cause survivors to experi- 
ence guilt produced by positive inequity that in turn would lead to improve- 
ment in subsequent work performance. In that laboratory experiment, each 
subject performed a clerical task and was led to believe that another person— 
who was actually an experimental accomplice—was simultaneously perform- 
ing a similar task. Subjects’ motivation for taking part in the study was to 
receive extra course credit. Midway through the proceedings, half of the 
subjects were told that an unforeseen problem made it impossible for both 
them and the other person to complete the experiment. Moreover, they were 
told that the person who did not complete the experiment would receive no 
credit for having taken part in the study. A seemingly random procedure 
always forced the accomplice ta leave while the subjects survived. These 
subjects, whom we designated as in the layoff condition, then performed the 
second half of the clerical task. 

The other half of the subjects, designated as in the control condition, 
were merely interrupted midway through the proceedings for a break. The 
primary dependent variables were (1) subjects’ perceptions of how fairly 
both they and the other person had been treated, (2) their guilt, and (3) their 
work performance, as indicated by the quantity of their productions and 
quality of their efforts on the first and second halves of the clerical task. 

The results were consistent with equity theory. Compared to those in the 
control condition, subjects in the layoff condition (1) reported that they had 
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been treated much more fairly than the other person, (2) reported feeling 
more guilty, and (3) increased the quantity—but not the quality—of their per- 
formance to a greater extent on the second half of the task. 

In this previous study (Brockner et al., 1985), decisions about whom to 
lay off were results of a seemingly random process. For practical as well as 
theoretical reasons, however, it seems important to study the effects of differ- 
ent rules for making layoff decisions on the work performance of survivors. 
At the practical level, managers faced with the difficult task of implementing 
layoffs will clearly be concerned with whether different decision rules elicit 
differential responses among survivors. If a particular decision rule causes 
survivors to feel and perform more positively, or at least less negatively, 
managers may wish to invoke that rule when layoffs are inevitable. At the 
theoretical level, equity theory provides reason to believe that decision rules 
significantly affect the subsequent performance of survivors. 

The present study was thus designed to extend the earlier study. We 
randomly assigned subjects to one of three experimental conditions. As in 
the previous study, we studied random layoff and control conditions. In 
addition, we assigned a third group of subjects to a merit layoff condition in 
which they were told that the decision about who was to stay was based on 
the relative merit of their performance on the first half of the task. We led 
subjects in the merit layoff condition to believe that they had outperformed 
an accomplice. 

Equity theory predicts that the merit layoff condition should have less 
effect on workers’ subsequent performance than will the random layoff 
condition. More specifically, subjects in the merit layoff condition may restore 
equity psychologically by perceiving that their better performance or greater 
input justified their status as survivors. Consequently, these survivors have 
less need to redress the inequity behaviorally through such means as height- 
ened work performance. In short, although we expected the quantity of 
survivors’ output to be greater in the random than in the control condition, 
we expected that there would be little or no difference in quantity of output 

. between those in the merit and control conditions. Instead, subjects in the 
merit condition would redress inequity by focusing on the fact that they had 
performed better than their co-worker on the previous task. 

The present study also addressed another important theoretical issue. At 
the outset we implied that guilt produced by positive inequity is one, but not 
the only, reaction that survivors may have to co-workers' layoffs. In actual 
organizations, survivors may have many other feelings. To name two 
possibilities, they may be worried or anxious that they too will be laid off at 
some future date, or angry, especially if they perceive the layoffs as 
illegitimate. Because the present study is derived from equity theory, it is 
important to isolate the effects of guilt produced by positive inequity from 
these other reactions of survivors. Thus, we induced the layoff conditions 
so as to arouse guilt produced by positive inequity but not other affective 
states like anxiety or anger. Of course, whether subjects in the layoff condi- 
tions actually felt anxious or angry in addition to feeling guilty is an empiri- 
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cal question. Accordingly, we assessed their affective states after the layoff 
manipulation. 


METHODS 
Subjects 


A total of 145 undergraduates at a large state university completed the 
experiment. We drew subjects from introductory psychology or management 
classes, offering them extra course credit for taking part. 


Procedures 


Upon arrival at the laboratory for what we called a test validation 
experiment, a subject was led into a large research room and seated at a desk. 
The experimenter told the individual that another subject had to arrive before 
the experiment would begin. Approximately 30 seconds later a woman 
confederate, posing as another subject, arrived and was seated at a desk 
diagonally opposite to the subject. Both subject and confederate were informed 
that they would (1) work on a proofreading task, (2) have a short break, and 
(3) work on a second proofreading task. The alleged purpose of the proofread- 
ing was to develop some standardized tests of cognitive performance. To 
minimize competition, we told both individuals that they would not be 
proofreading the same material. Furthermore, they were told that “to mini- 
mize any possible distractions we are going to have you do the proofreading 
in separate, private rooms.” 

At this point we ushered both confederate and subject into separate 
research cubicles to perform a proofreading task that has been used in previ- 
ous research (Reis & Burns, 1982). They received two six-page booklets con- 
taining a passage from Jane Jacobs's Death and Life of Great American Cities. 
One copy was designated uncorrected pages and had numerous spelling, 
grammatical, and punctuation errors throughout. The other, flawless copy 
was designated master pages. Subjects were instructed to proofread by com- 
paring the two and to underline but not correct all of the errors they observed 
on the uncorrected pages. They had eight minutes to read the passage. The 
experimenter emphasized the importance of both speed and accuracy to 
subjects; these provided indicators of quantity and quality of performance. 

The experimenter then left subjects alone in their research cubicles for 
eight minutes to perform the proofreading. After eight minutes had elapsed 
the experimenter reentered subjects' cubicles, had them underline the last 
line they had proofread, and asked them to follow her back to the large 
research room. The confederate, also fetched by the experimenter, came to 
the large research room several seconds later. During this rest period we 
introduced the layoff manipulation. Previously we had randomly assigned 
subjects to one of the three conditions. The experimenter told subjects in the 
random and merit layoff conditions that because of a room scheduling prob- 
lem only one person could finish the experiment; moreover, the person who 
left would receive no credit for having taken part in the study. Then the 
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experimenter stated an apparently impromptu basis on which the layoff 
decision was going to be made. 

The experimenter told subjects in the random layoff condition that both 
individuals would have to draw lots to determine which one of them would 
get to stay. We rigged the lottery so that subjects always were chosen to 
stay. For subjects in the merit layoff condition the experimenter said that 
the decision would be based on the quantity and quality of each individual’s 
performance on the first proofreading task. The experimenter then excused 
herself for a few moments, supposedly to grade the first proofreading tasks. 
Subjects were then always informed that they had performed better than the 
other person and therefore could stay while the confederate was forced to 
leave. 

To buttress the perceived unfairness of her having been forced to leave, 
the confederate said as she was exiting that it was really unfair to make 
her leave and deprive her of the extra credit. The experimenter verbally 
agreed that it was a tough break, but said nothing else. 

For the control condition the experimenter merely asked subjects not to 
speak with one another during the rest period. The experimenter then made 
busy work for herself, for about one minute, a period comparable to the time 
needed to complete the layoff for the random and merit layoff conditions. 

Subjects in all conditions were led back to their private research cubicles, 
in which they worked on a different six-page proofreading task taken from 
the same book used for the first task. For the control condition the confeder- 
ate was led back to her cubicle first, in full view of the subject. The instruc- 
tions and length of time allotted for the second task were the same as for the 
first. After eight minutes the experimenter reentered the subjects’ cubicles 
and asked them to underline the last line that they had proofread. 

Dependent variables. The behavioral dependent measures were mea- 
sures of the quantity and quality of proofreading performance on each task. 
We assessed the first by counting the numbers of lines that subjects had 
proofread during the eight-minute reading periods and determined the sec- 
ond by computing the percentages of errors that subjects had correctly identi- 
fied in the lines they had finished reading. 

After completing the second proofreading task all subjects responded to 
a final questionnaire, which included a manipulation check as well as a 
number of measures of their affective states while working on the second 
proofreading task. The two manipulation check questions required subjects 
to rate (1) how fairly the experimenter treated them (self-rating), and (2) how 
fairly the experimenter treated the other subject (other rating). Responses 
could range from 1 = notatallto 7 = more than fairly. Subjects also indicated 
the extent to which each of the following described how they felt while 
proofreading the second task: frustrated, worried, lucky, sad, confident, guilty, 
anxious, glad, competitive, angry, helpless, distracted, and sorry for the 
other person. Responses to each of these measures could range from 1 = not 
at all to 7 = a great deal. In addition, subjects responded to an item designed 
to tap their perceived inputs on the first proofreading task: “How well did 
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you perform on the first proofreading task?”’; responses could range from 1 = 
very poorly to 7 = very well . 

After they had completed this questionnaire the experimenter fully 
debriefed the subjects, paid and thanked them, and implored them not to 
speak with anyone about the research. 


RESULTS 
Manipulation Check 


Equity theory posits that perceived equity is the result of relative 
comparison—individuals’ comparing themselves to others. Thus, we employed 
an analysis of variance with two factors, condition x self-other, with repeated 
measures on the second factor. The relevant effect is the interaction, which 
was highly significant (F2? 145 = 14.15, p < .001). Subjects in the random layoff 
condition reported that they had been treated much more fairly than the 
other subject (X = 5.22 and 3.44), and subjects in the merit layoff condition 
did likewise (x = 5.34 and 3.63). But those in the control condition did not 
report that they had been treated more fairly (x = 6.08 and 5.79). The layoff 
induction thus appears to have been successful. Moreover, there was no 
evidence that subjects saw layoffs based on merit as less unfair than the 
random layoffs. What differed was how the subjects in the two layoff condi- 
tions responded to the perceived positive inequity. 


Behavioral Measures 


Quantity of performance. Subjects' performance on the proofreading 
task prior to the layoff manipulation did not differ as a function of experi- 
mental condition (F < 1). We then performed a two-factor (condition X task) 
analysis of variance with repeated measures on the second factor. The main 
effect for task was highly significant (F4,445 = 59.27, p < .001). Across 
conditions, subjects proofread many more lines in the second than the first 
task. This effect, which was not of primary importance, may have been due 
to a practice or warm-up effect. 

Of greater importance was the significance of the interaction between 
condition and task (F2 145 = 3.90, p < .025). As predicted and shown by Table 
1, the greatest rise in productivity between the first and second tasks occurred 
in the random layoff condition (X — 15.55 lines). By contrast, subjects in the 
control condition proofread 6.51 more lines on the second than the first task 
and those in the merit condition proofread 9.41 more lines on the second 
task. The results of a Newman-Keuls multiple comparison test revealed that 
the increase in the random condition was significantly greater than that 
exhibited in the control condition (p « .02), and in the merit condition (p « 
.06).. As predicted, there was no significant difference between the increases 
exhibited in the control and merit conditions (p > .25). The significant differ- 
ence between the random and control conditions thus replicates a previous 
research finding (Brockner et al.; 1985) whereas the comparisons between 
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TABLE 1 
Comparison of Productivity on Tasks for Three Experimental Conditions 





Mean Number of Lines Proofreadà 
Increase in Lines 


Experimental Conditions First Task ` Second Task Proofreadb 

Control 85.31 91.82 6.51 
(N = 39) (25.06) (27.21) 

Random layoff 87.67 103.22 15.55 
(N = 36) (27.34) (29.95) 

Merit layoff 84.07 93.48 9.41 
(N = 70) (25.72) (23.48) 


Maximum score = 164 lines; standard deviations in parentheses. 
bThe layoff manipulation took place between the first and second tasks. 


the merit and random layoff conditions and the merit and control conditions 
extend the earlier research. 

On the assumption that work performance is a function of motivation 
and ability, it may be useful to speculate about what mediated the significant 
results on the quantity measure. It is likely that the main effect for task 
reflected an increase in proofreading ability due to a practice effect from the 
first to the second task. The varied increase in work quantity across the three 
conditions, however, more likely reflected differences in motivation than in 
ability. 

Quality of performance. There were no differences in performance qual- 
ity on the initial task as a function of the upcoming layoff manipulation (F < 
1). We performed an analysis of variance for condition X task with repeated 
measures on the second factor. As in the analysis of the data on quantity, two 
significant effects emerged. First, the main effect for task was highly 
significant; subjects correctly identified fewer errors in the material in the 
second task (F1,142 = 24.50, p < .001). Second, and of greater importance, was 
the significant interaction between the factors (F2 445 = 4.89, p < .01). The 
greatest decline in quality between the first and the second tasks occurred 
for subjects in the random layoff condition (x = 63.50 and 53.72%); the least 
decline occurred for those in the merit layoff condition (X = 61.03 and 59.15%); 
and a moderate decline occurred in the control condition (X = 61.49 and 


1 Brockner and colleagues (1985) observed an increase in productivity under the random 
layoff condition particularly among subjects who were low in chronic self-esteem. In order to 
evaluate replicability, we had subjects complete the Janis-Field self-esteem scale (Eagly, 1967) 
at the outset of the present study. An ANOVA for self-esteem X condition X task yielded no 
significant effects for self-esteem, but the effect of self-esteem in the control and random layoff 
conditions was quite similar to that observed in the previous study. A simple analysis of effects 
revealed that people with low self-esteem increased the quantity of their performance more in 
the random than in the control condition (F, g9 = 5.31, p < .03). In marked contrast, in the random 
and control conditions people with medium and high self-esteem exhibited equal increases in 
productivity (Fs « 1). 
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56.23%). The Newman-Keuls multiple comparison test revealed that the 
change in quality of performance differed between the random and the merit 
layoff condition (p < .01), but that changes in the two layoff conditions did 
not differ significantly from the change exhibited in the control condition. 
Correlational analyses revealed that quantity and quality of performance 
were inversely related on both proofreading tasks, though somewhat more so 
on the second than the first (r443 = —.32 and —.56). In spite of these negative 
correlations, we should note that the data on change in quality between 
conditions were not simply a mirror image of the quantity results. Although 
subjects in the random conditicn exhibited both the sharpest decrease in 
quality and the sharpest increase in quantity, the condition in which quality 
of performance showed the least decline was the merit condition and the one 
in which the increase in quantity was the least was the control condition. 


Affective States 


Data from the manipulation check suggested that subjects in the layoff 
condition did perceive positive inequity in how they had been treated 
compared to their dismissed co-worker. Was this positive inequity associ- 
ated with heightened guilt or remorse as equity theory would predict? We 
found that it was. Subjects in the merit and random layoff conditions reported 
feeling more guilty (x = 2.84 and 2.36) while performing the second proof- 
reading task than did subjects in the control condition (x = 1.72; F2 442 = 4.65, 
p < .01). Furthermore, subjects reported feeling much more sorry for the other 
person in the merit and random layoff conditions (X — 4.81 and 5.03) than in 
the control condition (x = 1.74: F2145 = 35.73, p < .001). 

Although this evidence suggests that surviving a layoff evoked feelings 
of guilt or remorse, it does not rule out the possibility that the layoffs aroused 
other feelings unrelated to equity theory. For example, subjects in either of 
the layoff conditions could have felt worried or anxious about the possibilitv 
of their also being asked to leave, or angry. It is important, however, that 
subjects in the random and merit conditions did not report feeling more 
worried (F < 1), anxious (F < 1), or angry (F < 1) than did subjects in the control 
condition. In fact, the layoff manipulation had no significant effect on subjects’ 
feeling frustrated, lucky, confident, competitive, sad, helpless, or distracted, 
according to self-reports. The only other significant effect emerged for feeling 
glad (F2 145 = 5.22, p < .01). Not surprisingly, subjects in the merit and random 
layoff conditions reported feeling more glad (x = 4.51 and 4.06) than did 
those in the control condition (x — 3.49). 


Perceived Input 


The results for the manipulation check and measures of affective states 
revealed that subjects in the random and merit conditions reported feeling 
relatively equal levels of positive inequity, guilt, and remorse. Yet only those 
in the random condition significantly increased productivity. We predicted 
that those in the merit condition would be able to redress the positive ineq- 
uity through the notion of their greater input. They would justify their survi- 
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vor status by focusing on the fact that their initial performance or the task 
was better. In fact, subjects in the merit condition did evaluate their initial 
task performance more favorably (X = 5.09) than did those in the random (x = 
4.31) and control conditions (X = 4.49; F2 145 = 4.42, p< .01). Of course, such 
results do not prove that survivors in the merit condition redressed the 
inequity psychologically, but they are at least consistent with such an 
assertion. 


DISCUSSION 


In summary, data revealed that subjects saw the random and merit layoffs 
as more unfair to the other subject and that these layoffs aroused guilt and 
remorse. Perhaps most interesting, subjects appeared to redress the positive 
inequity in very different ways in the two layoff conditions. Random layoffs 
caused subjects to increase the quantity of their output; merit layoffs led 
subjects to perceive that they had performed more favorably on the first task 
than did their counterparts in the control or random conditions. This sug- 
gests that the merit survivors restored equity through psychological rather 
than behavioral means. Finally, the quality of survivors' performance was 
significantly greater in the merit than the random condition. 


Explanations 


The present study was generally derived from distributive justice theory 
(Deutsch, 1975) and grounded more specifically in equity theory (Adams, 
1965), Taken together, the three findings of the present study provide strong 
support for equity theory. 

First, data from the manipulation check revealed that subjects perceived 
their co-worker's dismissal to be unfair; in other words, subjects reported 
experiencing positive inequity as a result of a comparison with the worker 
who was laid off. 

Second, the results of the measures of affective states demonstrated that 
subjects in the merit and random layoff conditions reported feeling guilty 
and remorseful, both believed to be emotional consequences of positive 
inequity. 

Third, equity theory posits that inequity can be redressed through both 
behavioral and psychological means. Once again, our evidence supports the 
theory. Survivors in the random condition exhibited a greater increase in 
subsequent performance input than did those in the merit and control 
conditions. Survivors in the merit condition perceived their prior perfor- 
mance input as greater than did survivors in the other two conditions. — 

Alternative interpretations. Not only do the findings reported in the 
preceding paragraphs support equity theory, but other findings—primarily 
those drawn from the measures of affective states—are inconsistent with 
alternative interpretations. Thus, surviving layoff could have increased 
anxiety among subjects; or in a more positive vein, it could have increased 
confidence. Either could have affected their subsequent work performance. 
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For example, survivors in the random condition may have increased the 
quantity of their output to the extent that they experienced worry or anxiety 
produced by job insecurity. However, there was absolutely no evidence that 
subjects in either the random or merit conditions felt worry, anxiety, 
confidence, or any of a variety of emotions more acutely than did subjects in 
the control condition. 
The present findings extend the range of organizational phenomena to 
‘which equity theory is relevant. One frequent criticism of previous equity 
studies is that researchers have relied too much on overpayment to induce 
positive inequity. In actual organizations a variety of factors, not all having 
to do with money, can stimulate inequity. This study is one of the few 
attempts to manipulate inequity through nonfinancial means (cf. Greenberg & 
Ornstein, 1983). 


Future Research 


To reiterate, only a few earlier studies (e. g., Greenhalgh, 1983) have 
empirically investigated the impact of layoffs on survivors. We hope that the 
results reported here will stimulate further work on this topic of practical 
and theoretical significance. To increase the external validity of such future 
research efforts, investigators should study the effects of layoffs in actual 
organizations. It is likely that actual layoffs evoke many affective states 
among survivors—anxiety, guilt, anger, and relief, to name a few—that may 
influence their work behaviors and attitudes in complex ways. Investigators 
need to give careful thought to the factors that moderate the effects of layoffs 
on survivors. The results reported here suggest that the rule by which layoff 
decisions are made is one relevant factor; but there are countless others. For 
example, layoffs are bound to have more emotional impact when (1) prior 
work interdependence between the laid-off workers and survivors is high, 
(2) the survivors and laid-off workers are similar professionally or personally, 
and (3) survivors cannot easily find comparable work elsewhere. In sum, we 
suspect that many types of factors moderate the effects of layoffs on survivors. 
These include characteristics of the work itself, such as interdependence; 
individual differences among survivors, such as differences in self-esteem; 
formal organizational arrangements like the decision rule upon which layoffs 
are based; informal factors like similarity of co-workers; and environmental 
conditions such as the job market for comparable work. Of course, research 
exploring the effects of these factors should be theoretically grounded, as 
was the present study, in which we derived the decision rule variable from 
equity theory. . 

In addition, future investigators should assess the effects of several 
factors that are often present in actual organizations but that did not exist in 
our experimental setting. For example, the longevity of the performance 
effect obtained in the present study remains to be empirically evaluated. In 

.addition, the effects of anticipated layoffs are worthy of attention. Once 
again, we emphasize that it is important to study the impact of layoffs on 
survivors in more naturalistic settings. If anything, the laboratory procedure 
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used in this study, in which relatively little was at stake and in which 
survivors and laid-off workers were unacquainted, may grossly underesti- 
mate the impact of layoffs on survivors in actual organizations. 
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This study investigated the determinants of perceived ease of movement 
within an organization among two groups of clerical employees. It also 
investigated the relationship between perceived ease of movement and 
both organizational commitment and intention to quit. Younger employ- 
ees with shorter job tenure who felt that their personal characteristics 
matched organizational criteria for mobility and who received feed- 
back from their supervisors perceived themselves as having the greatest 
opportunities for advancement. Employees who desired mobility but 
perceived no opportunities were less committed to their organizations 
than employees who perceived they had opportunities for advancement. 


In one model of organizational careers (Milkovich, Anderson, & Green- 
halgh, 1976), “the career construct is represented in two forms, actual and 
` perceptual" (1976: 20). Actual careers have the “dual properties of (1) a 
patterned sequence of positions usually related in work content and (2) an 
ordered movement of individuals among these positions" (1976: 20). Per- 
ceived opportunity is an employee's interpretation of actual careers. Although 
there have been numerous studies on actual careers and mobility, perceived 
opportunities has been a neglected variable in empirical research. This con- 
cept has, however, been addressed from a theoretical perspective. Perceived 
ease of movement, which refers to employees' assessments of their opportu- 
nities to find better jobs, is a central factor in March and Simon's (1958) 
model of behavior at work. According to their theory, dissatisfied employees 
will explore alternative situations. If they see that alternatives exist —in other 
words, perceived ease of movement is high—these employees may leave their 
present situations. If perceived ease of movement is low because alternatives 
are not available, employees may adjust their aspirations or contributions to 
their organizations downward. 
March and Simon (1958) were primarily interested in determinants of 
perceived ease of movement outside an organization. The focus of this study 
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is perceived ease of movement inside an organization because employees 
-are just as likely to search for alternatives internally. In fact, March and 
Simon stated that whether there are opportunities within an organization 
will be related to an employee’s desire to leave. Employees may have only 
vague ideas about external opportunities but can estimate internal availabil- 
ity from “vacancy bulletins, awareness of frequency and mode of filling jobs, 
and extent of contact with supervisors” (Smith, 1979: 365). 

Perceived opportunity for intraorganizational mobility has implications 
for work motivation and attitudes. Lawler (1971) argued that employees will 
only work hard to get promoted if they perceive that appropriate positions 
are available and that promotion decisions are based on work performance. 
Studies have found that employees who perceive few opportunities for 
advancement have negative attitudes towards work and their organizations 
(Kanter, 1977; Kipnis, 1964; Larson, 1982). 

The twofold purpose of this study was (1) to investigate the determi- 
nants of perceived ease of movement, and (2) to examine the relationship 
between perceived ease of movement and attitudes towards organizations. 
Two groups of clerical employees were studied. We chose to study the careers 
of clerical employees because the recent revolution in office technology and 
current shortages of qualified clerical help in some areas of the country 
(Doudna, 1981) have spawned a debate over the careers of office personnel. 

Traditionally, the context of clerical jobs has been viewed as “an 
organized device to get work done with a minimum of disruption” (Cassell, 
Doctors, & Director, 1982: 8-9) rather than as an opportunity structure. 
However, Kanter (1977) found that career opportunity at all hierarchical 
levels can account for the way people involve themselves in their work. 
Dramatic increases in opportunity may heighten career aspirations, work 
commitment, and a sense of organizational responsibility for the rank-and- 
file, as well as for management. Career paths, patterns, and expectations may 
be different at low and high levels of a hierarchy, but lack of opportunity 
may lead to the same outcomes. This study was an exploratory attempt to 
look at the relationships between career expectations, experiences, and 
attitudes. 


HYPOTHESES 
Predictors of Perceived Ease of Movement 


Three previous models offer suggestions for determinants of perceived 
ease of movement within an.organization. These are Vardi’s (1980) and 
Anderson, Milkovich, and Tsui’s (1981) models of intraorganizational 
mobility, and March and Simon’s (1958) model of determinants of perceived 
ease of movement outside an organization. All three models include per- 
sonal characteristics as factors influencing expectations about mobility either 
directly or indirectly. Personal characteristics can be related to perceived 
ease of movement in two ways. First, individuals have profiles including 
characteristics such as gender, age, and socioeconomic status that partially 
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determine their attitudes towards work (Vardi, 1978). Second, "euch attri- 
butes as age, seniority and skill level serve as criteria for mobility when 
decisions to transfer, promote or demote employees are made by management" 
(Vardi, 1978: 48). We chose to examine two characteristics that contribute to 
an individual’s profile: age and job tenure. Previous research has found 
negative relationships between age and expectations of internal mobility 
(Moore, Miller, & Fossum, 1974; White, 1974). Three possible explanations 
have been given for this finding. First, age may be an organizational crite- 
rion for mobility; second, as people grow older they may become more 
realistic about the probability of attaining career goals; and third, people 
may become more cynical as they grow older (Moore, Miller, & Fossum, 1974). 

Research on job tenure and plateauing among managers has shown that 
employees who remain in the same jobs for a number of years are less 
satisfied with their careers than those who are more mobile (Veiga, 1981). As 
job tenure increases, unrealistic expectations of rapid promotions are replaced 
with more accurate perceptions of career opportunities based on experience 
in searching for better positions. The longer people are unsuccessful in 
advancing within their organizations, the lower their expectations about 
career mobility will be. 

Hypothesis 1: Age and job tenure will be negatively related 
to perceived ease of movement. 

The match between perceived organizational criteria for mobility and 
personal characteristics should also influence perceived ease of movement. 
Anderson and colleagues’ (1981) model of intraorganizational mobility 
included no direct link between individual characteristics and either actual 
or expected mobility. Instead, the match between individual characteristics 
and criteria for mobility explained both actual experiences of mobility and 
individuals’ expectations for future mobility. Past research has inferred such 
a relationship but has not examined it directly. 

In this study the variable of interest was perceived organizational cri- 
teria for mobility rather than actual criteria, although the two should be 
correlated. The match between actual criteria and individual characteristics 
will influence actual mobility, but it is the match between perceived criteria 
for mobility and individual characteristics that will influence perceived ease 
of movement. Actual criteria for mobility can be grouped into two categories: 
ability and nonability (Beehr, Taber, & Walsh, 1980). Nonability factors such 
as age, race, and gender are not bona fide occupational qualifications for most 
positions, but employees may perceive them as criteria for promotion. In 
particular, employees who have not been promoted may have such percep- 
tions, since people tend to attribute failure to factors beyond their control 
(Beehr et al.). Also, studies have shown that women, black people, and older 
workers experience less career progress than men, white people, and young- 
er workers (Elliott, 1966; Garbin & Bellweg, 1965; Grusky, 1966; Mahoney & 
Milkovich, 1982; Raudabough, 1974; Rosenbaum, 1979; Vardi, 1978). 

Ability factors such as education, skills, and performance are usually 
bona fide occupational qualifications on the basis of which management 
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largely makes or at least justifies decisions about promotions. Previous 
research has shown education to be a predictor of actual mobility (Dalton, 
1951; Grusky, 1966; Raudabough, 1974; Rosenbaum, 1979; Sorenson & Tuma, 
1981; Stone, 1953). Studies have also found education to be positively related 
to career aspirations (Hall, 1976; Moore et al., 1974; White, 1974). No one has 
empirically examined the relationships between employees’ perceptions of 
their own skills and performance and perceived ease of movement. However, 
it is likely that if employees believe job performance to be a criterion for 
mobility, those who perceive their performance as above average will per- 
ceive themselves as having greater ease of movement than those who do not 
perceive their performance as above average. 

Hypothesis 2: The match between perceived organizational 

criteria for mobility and personal characteristics will be 

positively related to perceived ease of movement. 

A perceived match between personal characteristics and organizational 
criteria will be irrelevant if there are no vacancies within an organization or 
if all vacancies are filled externally. Vardi (1980) suggested that employees 
will have a greater chance to move ahead when organizations have a policy 
of filling vacancies from within rather than.through an external labor market. 
Anderson and colleagues (1981) stated that if job vacancies are to be trans- 
lated into mobility, it must be assumed that those positions are filled from 
within an organization. We therefore propose that: 

Hypothesis 3: Perceptions that an organization has a pol- 
icy of filling vacancies from within will contribute posi- 
tively to perceived ease of movement. 

In March and Simon’s (1958) model, people’s visibility to other organiza- 
tions and the visibility of other organizations to them are factors influencing 
perceived ease of movement. We argue that visibility is an important factor 
in perceived intraorganizational movement as well. In particular, we believe 
that employees’ existing jobs can contribute to visibility by allowing them to 
move around in an organization and discover what jobs are available and by 
providing opportunities for them to develop skills appropriate for other jobs. 
Moreover, employees’ jobs determine their visibility to those responsible for 
hiring. Jobs with certain characteristics should thus facilitate the joint visibil- 
ity of employees and new jobs; these are (1) automony accompanied by 
responsibility, (2) contact with other people, and (3) task variety. 

White (1974) claimed that autonomy is a prerequisite for anyone who 
plans to stay visible as a job candidate because autonomous employees are 
held responsible for work outcomes, significant accomplishments can be 
attributed to them, and they can thus demonstrate potential success in posi- 
tions of higher responsibility. Using similar reasoning, Kanter (1977) argued 
against routinized jobs as mobility facilitators. 

The amount of contact employees have with organization members out- 
side their own areas should be related to both number of visible job alterna- 
tives and their own visibility to those who hire for the jobs. However, organi- 
zational policies such as regular postings of job openings in designated areas 
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or listings of vacancies in company newsletters may decrease the need to 
obtain such information from other people. 

Task variety may increase visibility by enabling employees to show 
broad ranges of accomplishments, demonstrating that they have skills neces- 
sary to perform other jobs. As Grandjean and Taylor (1980) argued, “underuse 
of skills and abilities prevents workers from demonstrating readiness for 
more responsible, challenging, and renumerative work” (1980: 40). 

Hypothesis 4: Autonomy, variety, feedback from other 
people, opportunities to deal with others, and opportuni- 
ties to learn on a job will be positively related to per- 
ceived ease of movement. 

Actual experience of mobility should also influence people’s expecta- 
tions of future mobility (Anderson et al., 1981). However, such a relationship 
is less likely among lower level employees than among managers because for 
the first group career ladders are shorter. Thus, they are likely to reach 
plateaus much more rapidly than managers. For employees who have reached 
plateaus, actual experiences of mobility may have a positive or a negative 
relationship to perceived ease of movement, or no relationship; it depends 
on an individual’s position on a career ladder. Although we state no formal 
hypothesis regarding the relationship between actual past mobility and per- 
ceived ease of movement, we will examine it. 


Outcomes of Low Perceived Ease of Movement 


Several studies have found that actual experiences of mobility are related 

to attitudes toward jobs and organizations (Bray, Campbell, & Grant, 1974; 
Grusky, 1966; Martin, 1979; Vardi, 1980; Veiga, 1981). Perceptions of future 
opportunities for mobility can also be expected to influence job related 
attitudes, since. past mobility is not related to future opportunities for mobil- 
ity for those employees who have plateaued or reached the tops of their 
career ladders. If employees who are interested in career advancement do 
not perceive that opportunities are available within their organizations, they 
will become dissatisfied and search for alternatives outside. They will com- 
pare these alternatives to their present situations and if comparisons favor 
the alternatives, they will leave (March & Simon, 1958; Mobley, 1977). If 
comparisons do not favor the alternatives, they may react in several ways. 
They may continue searching with intentions of quitting as soon as they do 
find acceptable alternatives. They may become less committed to their or- 
ganizations. Or they may react to lack of alternatives by lowering their aspi- 
tations and desires; previous studies of lower-level employees have found 
this outcome (Guest, 1954; Kanter, 1977; Vardi, 1980). 

Hypothesis 5: Perceived ease of movement will be posi- 

tively related to organization commitment and negatively 

related to intention to quit for those employees who desire 

mobility, but not for those employees who report no desire 

for mobility. 
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METHODS 
Respondents and Organizations 


This study was conducted in the summer in 1982 in two different types 
of organizations, a university and a state agency, both located in the north- 
eastern United States. The respondent population for the university was 
drawn from the 1981-82 staff directory. The study included all personnel in 
the following self-reported job categories: personnel assistant, office assistant, 
editorial assistant, typist, administrative aide, administrative secretary, 
secretary, administrative clerk, stenographer, word-processing operator, word- 
processing supervisor, secretary, and department secretary. Of the 1,176 
questionnaires sent out, 300 usable questionnaires were returned for a 
response rate of 27 percent. The respondents included 6 men and 16 individ- 
uals from minority groups. Their average age was 35; their average tenure was 
eight years; and 83 percent did not have four-year college degrees. 

Questionnaires at the state agency were distributed to all clerical staff in 
the following job categories: stenographers, typists, file clerks, clerks, account 
clerks, mail and supply clerks, statistical clerks, data entry machine operators, 
and dictaphone machine transcribers. Of the 1,600 questionnaires distributed, 
372 were returned for a response rate of 23 percent. These respondents 
included 25 men and 20 individuals from minorities. Their average age was 
37; their average tenure was eight years; and 92 percent did not have four- 
year college degrees. The two sets of respondents differed in some respects. 
Most of the clerical employees at the agency belonged to the Civil Service 
Employees' Association. Respondents at the state agency were older and had 
greater tenure than respondents at the university; respondents from the uni- 
versity were more highly educated. We gave data from respondents on race, 
gender, age, job tenure, education, job title, and job level to personnel repre- 
sentatives at both sites, who indicated that the respondents were representa- 
tive of their respective populations. 

The career development system at the university was employee-driven; 
individual employees had sole responsibility for initiating transfers. Employ- 
ees who had been in their current positions at least nine months were eligi- 
ble to apply for other jobs. Two campus publications posted position open- 
ings weekly. Staffing services reviewed all applications and referred qualified 
employees to the departments that were hiring. The university had a policy 
that employee candidates were to be given preference over external candi- 
dates. The departments that were hiring were responsible for giving employ- 
ees notice of any action taken, and they conducted interviews and made 
decisions independently. 

The state job-transfer system was also employee-driven but was some- 
what more centralized than the university's process because of the civil 
service exam system. The promotion process included four separate steps: 
eligibility to compete, competitive examination, canvas, and selection (Center 
tor Women in Government, 1981). Employees could obtain exam announce- 
ments and promotion application cards from the agency personnel office. 
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Candidates who qualified to take exams and who passed got numerical 
scores and were ranked from highest to lowest on master examination lists. 
Points were added to scores for seniority. When vacancies occurred, candi- 
dates who were on appropriate lists would be asked by the people doing the 
hiring if they were interested in vacant positions and a selection would be: 
made from the top three candidates who expressed interest. Employees would 
usually remain on lists for one to two years and then would have to take the 
exams again. The same exams applied to all agencies; however, each agency 
had its own list and would consider employees from that list when vacan- 
cies occurred. Therefore, for clerical employees, transfers from one agency to 
another were very unusual. 


Procedures 


The procedure for distributing the questionnaires differed for the two 
sites. At the university, the personnel department granted permission to 
conduct the study with a provision that questionnaires be sent to employees' 
home addresses and completed on their own time. A cover letter accompa- 
nying the questionnaire explained that the study was research for a Ph.D. 
dissertation on intraorganizational career mobility and the researchers were 
not acting as consultants. We instructed individuals to return the completed 
questionnaires directly to us in prestamped envelopes that we provided. At 
the state agency we conducted the study with the approval of both the civil 
service and the agency. The personnel department distributed question- 
naires and cover letters to supervisors, who distributed the questionnaires 
to their staff. Employees were given the same explanation for the study and 
instructed to complete the questionnaires during working hours and re- 
turn them directly to us in the prepaid envelopes. The agency included an 
additional cover letter explaining that the purpose of the study was research 
on career mobility and differentiating it from other studies being conducted 
at the same time. 


Measurement Instruments 


Background information on salary scales, career ladders, and internal 
recruiting and selection procedures was obtained from interviews with per- 
sonnel staff at both sites. Responses to the questionnaire referred to in the 
preceding section provided all measures of the variables included in the 
hypotheses. The questionnaire was eleven pages long. To the extent possible, 
we used scales that previous studies have validated. Table 2 lists alpha 
coefficients for all scales. 

Job characteristics. Job characteristics were measured by four of Hack- 
man and Oldham's (1980) scales: autonomy, feedback from agents, dealing 
with others, and skill variety; and one of Rousseau's (1977) scales, opportu- 
nity to learn. The solution of a factor analysis of the items supported the a 
priori scales with one exception: opportunity to learn and skill variety loaded 
on the same factor. Therefore, we combined these two scales. 
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Organizational commitment. This was measured using the nine-item, 
7-point response scale developed by Mowday, Steers, and Porter (1979). A 
high score on this scale indicates a high level of commitment. 

Intention te quit, perceived organizational policy on filling vacancies, 
desire for mobility, and perceived ease of movement. These variables were 
all measured by multi-item scales created specifically for this study. The 
Appendix gives response formats and items for each scale. Since previous 
researchers have argued that intention to remain within an organization is 
one component of commitment (Katz & Kahn, 1966), we performed a factor 
analysis on the intention-to-quit and commitment items. Results showed 
two distinct factors. Items for the remaining three scales mentioned above 
were also factor analyzed; results showed three distinct factors, which sup- 
ported the a priori categorization of items. 

Perceived match. The match between employees’ perceptions of organiza- 
tional criteria for mobility and personal characteristics was measured by 
comparing responses to items on personal characteristics such as age, gender, 
and level of job performance with responses to items concerning the impor- 
tance of specific criteria for mobility. The Appendix includes details on 
construction of this scale. 

Rate of mobility. A work-history table adapted from Vardi and Hammer 
(1977) was the source for actual information on mobility. We asked employ- 
ees to record dates, job titles, and jab grades for each job they had held since 
joining their organizations. Frequency and direction of job moves were com- 
puted by using job grade information, as explained in the Appendix. 

Other measures. Employees were also asked whether they were satisfied 
with their jobs, whether they would take courses to improve their job 
opportunities, and what position they wanted to be promoted to if they 
wanted to be promoted. These were all single items. 


ANALYSES AND RESULTS 
Career Experiences and Aspirations 


Data on career experiences and aspirations provide the context for inter- 
preting the results of the study. Table 1 shows means, medians, standard 
deviations, and t-values for different mobility measures for the two sites. 
Results show that rates of overall and upward mobility were greater at the 
university than at the state agency. Lateral moves were more common at the 
state agency. Thirty percent of the respondents at the university and 37 
percent of the respondents at the agency were still in their first jobs. These 
mobility rates seem low compared to the mobility rates for blue-collar workers 
of 27 to 73 percent, depending on the company, found by Vardi and Hammer 
(1977). However, low average job tenure at both sites could partially explain 
this discrepancy. 

Results also showed that clerical employees were very interested in 
career advancement. Only 24 percent of university respondents and 18 per- 
cent of state agency respondents reported no interest in moving to better 
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TABLE 1 
Comparisons of Actual Mobility for Both Groups 
DE —— ————— 
University State Agency 

Measures Medians Means s.d. Medians Means s.d. Values oft 
Neie ee ee ee ee 
Total mobility 20 .26 .29 At 19 25 3.26* 
Total mobility 

minus demotions 18 25 29 0 , 48 .23 3.32* 
Lateral mobility 0 20 .11 0 .07 46 —2.94* 
Upward mobility 17 .05 .27 0 11 18 5.48* 
Demotions 0 Ot .06 0 01 07 — 484 

*p < .05 


jobs. Most respondents—96 percent in the university and 92 percent in the 
agency—said they would be willing to take courses to improve their career 
. opportunities. Employees did not seek mobility because of dissatisfaction 
with their current jobs; 71 percent of university respondents and 58 percent 
of state agency respondents said they liked their jobs. 


Tests of Hypotheses 1 through 4 


Table 2 shows descriptive statistics and the intercorrelation matrix for 
all variables for both sites. We computed t-tests to examine differences 
between the two sites. Results showed that agency respondents reported less 
task variety, less autonomy, less interaction with others, greater desire for 
mobility, and greater intention to quit than respondents at the university, 

Table 2 also shows that results from both groups supported Hypotheses 
1, 2, and 3. Age and job tenure were negatively related to perceived ease of , 
movement. Two variables, (1) the match between perceived criteria for mobil- 
ity and personal characteristics, and (2) perceived organizational policy of 
filling vacancies from within, were positively related to perceived ease of 
movement. Results only partially supported Hypothesis 4. Feedback was 
positively related to perceived ease of movement for both groups. However, 
none of the other job characteristics were related to perceived ease of move- 
ment for the agency respondents. For the university respondents, autonomy 
and variety were positively related to perceived ease of movement. 

In addition, Table 2 shows that some of the variables hypothesized to be 
correlated with perceived ease of movement were correlated with each other. 
Age was positively correlated with job tenure and negatively correlated with 
the match between perceived organizational criteria for mobility and per- 
sonal characteristics. Job tenure was also negatively correlated with the match 
variable. Perceived organizational policy of filling vacancies from- within 
was positively correlated with feedback, autonomy, and variety. 

In order to determine the unique contribution of each variable in explain- 
ing the variance in perceived ease of movement, we used multiple regression 
analysis. Independent variables were age, job tenure, perceived organiza- 
tional policy of filling vacancies from within, match between perceived 
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TABLE 3 
Results of Regression of Perceived Ease of Movement on Age, Job Tenure, 
Perceived Policy on Vacancies, Match, and Four Job Characteristics 








University State Agency 
a Betas o SE uL o orn 
Rate of mobility 02 0.16 —.07 2.02* 
Age —.22 12.57* —.16 8.57* 
Job tenure -—.12 3.77* 30 28.36* 
Perceived policy 

on vacancies .08 1.89 > 11 4.80* 
Perceived match 21 14.14* —.14 7.04* 
Feedback 47 7.49* .18 9.96* 
Dealing with others —.08 1.89 Dé .50 
Autonomy .01 0.02 .00 .00 
Variety 14 4.09* —.06 .65 
R? .28 .27 
Adjusted R? .25 .25 

*p«.05 


organizational criteria for mobility and personal characteristics, the four job 
characteristics, and rate of mobility. Table 3 shows the results of the analyses 
for both groups. Controlling for the other variables, we found that age and job 
tenure were negatively associated, and feedback and the match between 
' perceived organizational criteria for mobility and personal characteristics 
were positively associated with perceived ease of movement for both sites.! 
For the university respondents, variety was positively related to perceived 
ease of movement. For the agency respondents, perceived organizational 
policy of filling vacancies from within was positively related to perceived 
ease of movement. For the state agency respondents, rate of mobility also 
was positively associated with perceived ease of movement in the regression 
analysis; however, the zero-order correlation between the two variables was 
not significant. For the university respondents, rate of mobility was not 
associated with perceived ease of movement once other variables were 
controlled. 

For both groups of respondents the adjusted R? was .25. To determine 
whether results were population specific, we used a Chow test (Kennedy, 
1979) to examine whether the overall equations for the two sites differed 
significantly. We also compared specific parameters for the two equations by 
running a regression for data from the two sites combined, including site as a 
dummy variable and eight interaction terms—site times each of the other 


1 Since age was highly correlated with perceived ease of movement, and age was included 
in the perceived match scale, there was some concern that the effects of age alone were account- 
ing for the relationship between the perceived match and perceived ease of movement. Thus, 
we regressed perceived ease of movement an the independent variables, with age removed from 
the perceived match scale. The results were identical to the first results. 
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independent variables (Cohen & Cohen , 1975).? There were no significant 
differences between the overall His or any of the parameters except for vari- 
ety on the job. Variety on the job contributed positively to perceived ease of 
movement for the university respondents, but not for the agency respondents. 
With the two groups combined, neither rate of mobility nor the interaction of 
rate of mobility with site were significant. Therefore, the negative relation- 
ship between rate of mobility and perceived ease of movement for the agency 
respondents may only be a statistical artifact. 


Test of Hypothesis 5 


It was hypothesized that desire for mobility would moderate the relation- 
ship between perceived ease of movement and both organizational commit- 
ment and intention to quit. Following Arnold (1982), we used subgroup 
analysis rather than moderated regression to test this hypothesis. Employees 
were divided into two groups: those who desired mobility, as indicated by 
scores above the median on the scale measuring desire for mobility, and 
those who were not very interested in moving from their present position, as 
indicated by scores below the median on the scale measuring desire for 
mobility. We then used Fisher's Z transformation to compare correlation 
coefficients between the two subgroups (Arnold 1982). Table 4 shows results. 
Perceived ease of movement was negatively related to intention to quit and 
positively related to organizational commitment only for those employees 
who scored high on the scale measuring desire for mobility. However, for the 
university respondents, the difference between subgroups for intention to 
quit was not significant. Therefore, Hypothesis 5 received only partial support. 


TABLE 4 
Subgroup Analysis of Perceived Ease of Movement 
with Organizational Commitment and Intention to Quit 





Great Desire Less Desire 
Variables for Mobility? z for Mobility? z z 
(a) University 
Organizational commitment .24* -245 .03 .030 1.69* 
Intention to quit —.14* .141 —.09 .090 Al 
(b) State Agency 
Organizational commitment —.27* 277 .09 .090 1.76* 
Intention to quit —.22* .224 —.04 .040 1.74* 
aN = 137 
DN = 128 
SN = 175 
aN = 184 
*p«.05 


? Cohen and Cohen (1975) made a distinction between using regression equations for predic- 
tion versus explanation. When researchers are interested in using an equation for prediction, 
they must cross-validate results. When the purpose of a study is explanation, as in this case, the 
significance of the differences between betas should be tested. 
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Table 4 shows that several of the variables hypothesized to be determi- 
nants of perceived ease of movement were related to commitment and inten- 
tion to quit. To investigate whether the relationship between perceived ease 
of movement and each of these two variables was spurious, we regressed 
both intention to quit and organizational commitment on perceived ease of 
movement and its hypothesized determinants for those employees who 
desired mobility. Table 5 shows the results. For the university respondents, 
perceived ease of movement explained 2 percent of the variance in commit- 
ment with other variables controlled; however, perceived ease of movement 
was not significantly related to intention to quit. For the state agency 
respondents, perceived ease of movement explained 2 percent of the vari- 
ance in commitment and 5 percent of the variance in intention to quit, with 
all other variables controlled. 


DISCUSSION 


This study showed that young clerical employees who had short job 
tenure, felt that their personal characteristics matched organizational 
criteria for mobility, and received feedback from their supervisors and 
others about how well they were doing on their jobs perceived the greatest 
opportunities for mobility within their organizations among the employees 
studied. The relationship between age and perceived ease of movement 
reflected reality at the university, which had a salary structure of overlap- 
ping pay scales for different jobs. Older employees with long tenure were 
usually at the top of their pay scales, so if they were promoted they would 
have to be paid at the middle or top of the scale for the next highest grade. 
Thus, departments often hired younger or new employees whom they could 
pay at the low ends of scales. 

The lack of relationship between perceived organizational policy on 
vacancies and perceived ease of movement was surprising. However, the 
respondents who perceived that their organizations filled vacancies exter- 
nally were less committed to their organizations, and the university respon- 
dents having this perception also reported greater intentions to quit. Work 
attitudes may determine perceptions of policy rather than vice versa. Studies 
have shown that individuals tend to attribute failure to factors beyond their 
control but attribute success to their own efforts (Beehr et al., 1980). Therefore, 
if employees are dissatisfied with their own careers and have been unsuc- 
cessfully attempting to transfer to better jobs, they may conclude that their 
organization prefers to fill positions through the external labor market. 

The positive relationship of perceived ease of movement and the match 
between perceptions of organizational criteria for mobility and personal char- 
acteristics provides some evidence that employees assess their opportunities 
for mobility on the basis of whether they meet the criteria they perceive to be 
necessary for promotion; however, the amount of variance explained by the 
scale was not large. More research needs to be conducted on employees’ 
perceptions of criteria for promotion and what factors influence those 
perceptions. 
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The positive relationship between feedback and perceived ease of move- 
ment emphasizes the role supervisors play in the development and career 
satisfaction of subordinates, particularly in organizations where job transfer 
systems are decentralized. In this study, positive feedback was not differenti- 
ated from negative feedback, but the positive relationship between feedback 
and the match between perceived organizational criteria for mobility and 
personal characteristics indicates that employees tend to think of feedback 
in positive terms only. 

The other job characteristics that we suggested would determine visibil- 
ity of individuals to decision makers and visibility of jobs to individuals 
were not related to perceived ease of movement. Gould and Penley (1984) 
found that clerks do not report using the career strategies of networking and 
creating opportunities as frequently as do managers. We do not know whether 
their jobs constrain them from using these strategies. In an organization such 
as the state agency, where mobility is practically nonexistent and likely to 
be lateral rather than vertical, no use of any strategy is likely to lead to suc- 
cess, nor are the presence of certain job characteristics likely to increase op- 
portunity. More research on the relationship between career strategies and 
job characteristics for all occupational groups is needed. 

The stability of results across the two sites lends credence to these 
conclusions. However, only 25 percent of the variance in perceived ease of 
movement was explained. This study limited the hypothesized determi- 
nants of perceived ease of movement to variables on an individual level. 
Future studies should also examine variables at environmental and 
organizational levels. Environmental variables would include economic 
conditions, current levels of business or service activity, and demand for 
particular occupational groups. Organizational variables would include size, 
technology, number of hierarchical levels, staffing procedures, job posting, 
career counseling, and career pathing. Anderson and colleagues (1981) 
hypothesized that “the characteristics of the environment, organization, and 
work force will have an indirect effect on mobility through their primary 
influence on the opportunities or vacancies in the organization" (1981: 532).? 
However, whether individuals consider these factors in assessing their career 
opportunities is open to debate. This study was conducted in a time of high 
unemployment in towns that were almost entirely dominated by single 
organizations. Given these environmental characteristics, employees were 
overly optimistic about intraorganizational opportunities for mobility. One 
possible explanation for these results is that employees who perceived no 
opportunities saw no point in participating in the study. The possibility also 
exists that other occupational groups, such as professionals or upper-level 
managers, may be more attuned to the influence of environmental and organi- 
zational characteristics on career opportunities than lower-level employees. 

The study also showed that employees who desired mobility but per- 
ceived little opportunity to move to other positions were less committed to 


3 For a review of pertinent litérature, see Anderson and colleagues (1981). 
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their organizations and reported greater intentions to quit than employees 
who did perceive themselves as having opportunities. However, controlling 
for other variables, we found that, for the university respondents, perceived 
ease of movement was not related to intention to quit. Outside job opportuni- 
ties were probably greater for state agency employees than for the university 
employees, since the state agency was located in a larger community. Also, 
university employees were more satisfied with their current jobs, which had 
more autonomy, variety, and feedback, than employees at the state agency, 
and many of them had already been promoted. 

These results could have implications for organizations that have career 
mobility systems and procedures similar to the organizations studied. 
First, these organizations may want to identify those lower-level employees 
who desire advancement. Second, if these employees are high performers, 
organizations could encourage their career development and create ladders 
for them that permit advancement. If advancement is not possible, jobs could 
be designed to maximize variety, autonomy, and feedback. Otherwise, in a 
favorable job market, these organizations could lose their best support staff. 

Finally, there is need for research that investigates the meaning of career 
advancement to various groups of employees and the relationships between 
actual opportunities for mobility, perceptions of mobility, and performance 
in organizations. To date, most studies have focused on either aggregate or 
individual levels of analysis. In a recent study, Scholl (1983) investigated the 
relationship between objective career-based variables (aggregate level) and 
turnover and intention to remain (individual level). More studies of this type 
are needed—but two missing links should be added: employees’ interpreta- 
tions of mobility, and their expectations for the future. 
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APPENDIX 


The items and response formats for all scales created specifically for this study appear 
below. Construction of the scale measuring the match between employees' perceptions of 
organizational criteria for mobility and personal characteristics and computations for actual 
mobility measures are also discussed in detail. 

Unless otherwise noted, we used a 7-point response format, anchored 1 = strongly disagree 
and 7 = strongly agree for the items in the scales measuring perceived ease of movement, 
intention to quit, perceived organizational policy or vacancies, and desire for mobility. 


Perceived ease of movement 

My chances for moving above my present position are high. 

It would be easy to find a job in another department. 
The following two items were answered according to a 6-point response format, anchored 1 = I 
have no chances at all, and 6 — I have very high chances: 

In your opinion, what are the chances you have to move to another job at the same level as 

the one you now have, but in a different department? 

In your opinion, what are your chances of moving to a job at a higher level within ? 
The four items for this scale were all standardized to correct for differences in response formats 
and then averaged. 


Perceived organizational policy of filling vacancies from within 
Job vacancies at are usually filled by people from outside E 
A employee who applies for another job at hasa better chance of getting that job 
than someone from outside who applies for the job. 
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Intention to quit 
As soon as I can find a better job, I'll leave . 
I am seriously thinking about quitting my job. 
I am actively looking for a job outside 


Desire for mobility 
I would like a job with more responsibility. 
If I’m not promoted from my present job within three to five years, I will be disappointed. 
I would feel much better about working at if I were promoted. 
I am not interested in moving from my present job. (Reverse scored) 
Mobility 
To compute rate of mobility, we summed frequencies of lateral and upward mobility and 
divided the result by organizational tenure. Frequencies of lateral moves, upward moves, and 
demotions were also computed and divided by organizational tenure. 





Perceived match between organizational criteria for mobility and personal characteristics 
The perceived match scale was created by comparing two dichotomies: whether respon- 
dents perceived each of 18 factors as helping or hindering mobility and whether they had or did 
not have that particular characteristic. The 16 factors were performing well on the job, being a 
woman, being a man, having a high skill level, knowing the right people, being Asian, being 
white, being black, being Indian, being Hispanic, having a great deal of education and training, 
making a personal effort to get a job, being under 25, being between 16 and 35, being between 35 
and 45, and being over 45. Respondents answered these items on a 7-point response format, 
anchored 1 = greatly hinders and 7 = greatly helps. We then dichotomized responses, with a 1, 2, 
or 3 categorized as hindering and a 5, 6, or 7 response categorized as helping movement to 
another job. d 
- These responses were then compared to responses on the following items: gender, race, age, 
level of education, perceived level of job performance, perceived ability to move to another job, 
perceived connections with people who could help respondents move to other jobs, and amount 
of personal effort expended to find a better job. The last four variables were measured on scales 
with two, three, two, and two items respectively, with Likert-type 7-point response formats. 
Those who scored less than 4 were categorized as not having that characteristic, while those 
who scored greater than 4 were categorized as having the characteristic. 
A total of 36 comparisons were made. Sixteen of the comparisons were helping conditions. 
For instance, if being male was perceived as helping mobility, and the respondent was male, the 
match was scored 1. If being male was perceived as helping mobility, but the respondent was 
female, the score was 0. Twenty of the comparisons were hindering conditions: if being female 
was perceived as hindering mobility, and the respondent was female, the match was scored 1. If 
being female was perceived as hindering mobility, but the respondent was male, there was no 
match and the score was 0. We summed the hindering and helping conditions separately, and 
the subtracted the hindering score from the helping score. Therefore, the higher the score the 
greater was the perceived match between organizational criteria for mobility and the personal 
characteristics of a respondent. 
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and Labor Relations, Cornell University; she is an assistant professor of organizational 
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EFFECTS OF AN EXTRINSIC REWARD 
ON INTRINSIC MOTIVATION: 
A FIELD EXPERIMENT 


PAUL C. JORDAN 
University of Alaska-Fairbanks 


In their original research, Deci (1971, 1972a, 1972b, 1975) and his associ- 
ates (Deci, Betly, Kahle, Abrams, & Porac, 1981; Deci, Cascio, & Krussell, 
1975) claimed that expectation of contingent rewards undermines intrinsic 
motivation. There have been more than two dozen studies dealing with the 
effects of contingent rewards on intrinsic motivation. Unfortunately, lack of ` 
a standard terminology has been a major obstacle to integrating these findings. 
A recent study overcame that problem by standardizing a vocabulary and 
applying it to a review of the procedures and findings (Ryan, Mims, & 
Koestner, 1983). The result was a categorization of previous studies that 
revealed two important gaps in this area of research. First, none of the research 
compared effects of rewards that were contingent upon performance to effects 
of rewards that were not contingent upon performance. Second, most previ- 
ous studies were laboratory ones, and no field research has validated their 
results (Pinder, 1976). 

The purpose of the present study was to test Deci's hypothesis with a 
better research design than previous studies used. The design of this study 
incorporated these four features: (1) performance-contingent rewards were 
compared to rewards that were not contingent upon performance; (2) per- 
formance-contingent rewards were offered in combination with regular pay; 
(3) a field setting was used rather than a laboratory setting; and (4) intrinsic ` 
motivation was measured with a scale that has established construct validity. 
This study also explored possible effects of contingent rewards on satisfac- 
tion with pay because previous studies have not examined the effects of 
expected rewards on people's satisfaction prior to their receiving rewards. 

Deci's original work suggested the need to test the following hypotheses: 

Hypotheses 1: The expectation of a reward that is contin- 
gent upon performance will decrease intrinsic motivation. 
Hypothesis 2: The expectation of a reward that is contin- 
gent upon performance will have no effect on satisfaction 
with pay. 


The author wishes to express his appreciation to Dr. William A. Smith, Jr., Department of 
Industrial Engineering, and Dr. Donald D. Drewes, Department of Psychology, North Carolina 
State University, for their assistance in this research. 
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METHODS 
Research Setting 


In 1977 the North Carolina General Assembly enacted Senate Bill 722, 
creating a pilot incentive program for state employees. Pilot programs were 
established in three units of the state government, including the Western 
Carolina Center in Morganton, North Carolina, which provides care for 500 
mentally retarded and physically handicapped children. The Center’s pilot 
program was tested during a one-year period, from July 1, 1978 through June 
30, 1979. 

This incentive program promised 225 employees of the Center equal 
shares of up to 25 percent of all cost reductions at the end of the one-year 
period. The savings for an administrative unit were to be determined by 
calculating the difference between the previous two years’ average costs and 
average costs during the experimental year. Because the Center’s major 
expense was labor, the plan’s instigators anticipated savings from workforce 
reductions through attrition and retirement. Analysis of expenditures revealed 
that a 10 percent decrease in variable expenses was the maximum possible 
cost reduction. If achieved, this reduction would mean a $500 bonus for each 
participant in the plan. Participating employees were not told the amount of 
their potential rewards. 


Design 


This study used the 2 x 2 x 2 design recommended by Winer (1971) for 
experiments witb repeated measures on a single factor. The design included 
two treatment conditions. In the first, rewards were contingent upon perfor- 
mance; in the second, rewards were not contingent upon performance. The 
design also included two work shifts, day and night, and two measurement 
times, one month before and two-and-a-half months after the beginning of the 
incentive program. In order to have as many respondents as possible, random 
assignment of individuals to treatment conditions was not employed. 

All potential subjects were tested in their normal work assignments. 
Preestablished work shifts were preserved so that the study could detect 
differences arising from variations in supervision. i 


Subjects 


The 48 health care technicians at the Center recruited for the program 
provided handicapped and mentally retarded children with personal care, 
including feeding, training, and supervision. Units H and A were selected 
for participation in the program from four administrative units in the Center 
because their patients and tasks were similar. Unit H was selected as the 
group with rewards contingent upon performance because it had the largest 
number of individuals (32). Unit A, with 16 members, was selected as the 
group with rewards that were not contingent upon performance. All subjects 
had at least high school educations, and most were from lower economic 
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strata. No specific training was required to perform their job duties. Table 1 
shows biographical data for the subjects. 


Measures of Dependent Variables 


The study used the Job Diagnostic Survey (JDS) developed by Hackman 
and Oldham (1975) to measure internal work motivation. Fisher (1978) con- 
cluded that this instrument's internal work motivation scale is identical to 
what Deci (1975) defined as intrinsic motivation. 

The Job Descriptive Index (JDI) (Smith, Kendall, & Hulin, 1969) mea- 
sured satisfaction with pay. No measurement was taken to determine whether 
or not subjects expected to receive rewards. i 


Pretest 


The JDI and JDS were administered simultaneously to all employees in 
the two work groups one day prior to the announcement of the incentive 
program. Supervisors requested employees’ participation in the survey but 
emphasized that it was voluntary and that refusal to participate would not 
affect their employment. Employees were told that the purpose of the JDI 
and JDS was to discover how they felt about their jobs, supervisors, 
co-workers, and working conditions so that management could learn where 
improvement was needed. In addition, they were told that all responses 
would be kept confidential and that only group summary scores would be 
published. 

The employees were also asked to complete the biographical data 
requested on the JDS. Survey responses were grouped by unit and shift, and 
the names of those who took the survey were recorded on a separate list to 
ensure that the same individuals would participate in a later survey. 

The following day, the Center’s administrators announced the state’s 
incentive program for Unit H to all employees. They made no mention of the 
survey since it was not part of the program. Employees were told that if the 
pilot program, to begin in one month, was successful it would be imple- 
mented in other units later. 


TABLE 1 
Biographical Data on Subjects 


Age Ranges 
Experimental Groups Gender 20-29 30-39 40-49 50—59 Totals 
Rewards contingent Men 6 3 1 1 11 
on performance Women 6 5 4 6 21 
Rewards not contingent Men 1 1 0 0 2 
on performance Women 1 5 5 3 14 
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Posttest 


A posttest survey was administered 212 months into the incentive period 
and 3% months after the initial survey. Data on cost reductions were not 
available at the time of the posttest so subjects still had no idea of the 
amounts of their potential rewards. Those who had taken the pretest survey 
were asked to take the posttest survey. Survey responses were again marked 
and grouped according to unit anc shift. Pretest and posttest survey responses 
were matched by using biographical data such as age group, gender, and 
educational level; where necessary, matches were based on subjects’ 
handwriting. Random selection of sets of two matched surveys for both 
conditions followed; after selection the group with rewards contingent upon 
performance contained 32 individuals and the group with rewards that were 
not contingent on performance contained 16 individuals.* 


. Analysis 


An analysis of variance (Winer, 1971: 540) using the 2 x 2 design referred 
to earlier in this section was conducted for the measures of intrinsic motiva- 
tion and satisfaction with pay. If extrinsic rewards affected intrinsic motiva- 
tion and satisfaction with pay, the interaction of treatment X time (A x C) 
would be significant. Normality tests were conducted to ensure proper use of 
parametric statistics and a t-test (Winer, 1962) was used to test the simple 
main effects. 


RESULTS 


Analysis of variance showed that only the measure for intrinsic motiva- 
tion had a statistically significant F-ratio, which was 11.486 for the A x C 
(treatment x time) interaction (F444 = 7.25, p < .01). Table 2 shows the 
means and standard deviations for the two dependent variables. 

Figure 1 illustrates two findings. Subjects in both shifts who were in the 
group with rewards contingent on performance experienced decreases in 
intrinsic motivation. But in the groups with rewards that were not contin- 
gent upon performance, individuals on both shifts experienced increases in 
intrinsic motivation. Results for satisfaction with pay were not significant. 
These findings support Hypotheses 1 and 2. 


DISCUSSION 


The results of this experiment provide empirical evidence that, for these 
workers in this setting, expectation of rewards contingent upon performance 
decreased intrinsic motivation. Deci's (1971) theory of cognitive evaluation 


! The pretest was administered to 48 performance contingent and 30 performance non- 
contingent subjects. Attrition and absentegism at the time of the posttest reduced these numbers 
to 42 and 21 respectively. From these matched responses, 32 and 16 were randomly selected to 
provide for the proportionality of cells frequencies needed for the ANOVA. 
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TABLE 2 
Means and Standard Deviations 
for Dependent Variables 








Intrinsic Satisfaction 
Motivation With Pay 
Experimental EE RP ——— 
Times of Surveys Groups Shifts | Means s.d. Means s.d. 
1 month before Rewards not 1 5.30 .350 8.75 8.94 
start of incen- contingent 2 5.04 .704 18.70 67.90 
tive program on performance 
Rewards con- 1 5.60 .492 9.37 29.30 
tingent on 2 6.08 .320 11.10 — 101.00 
performance 
234 months after Rewards not 1 5.81 161 6.50 26.70 
start of incen- contingent 2 5.52 537 14.70 184.00 
tive program on performance 
Rewards con- 1 5.29 879 104.00 121.00 
tingent on 2 5.89 .277 13.10 111.00 
performance 


predicted that this would be the case. Generalizing the results of this experi- 
ment is inappropriate for two reasons. First, assignment of subjects to treat- 
ment conditions was not random. Second, because this was a field experiment, 
subjects were exposed to a wide variety of uncontrolled stimuli. However, if 
future research tests the same factors in other field settings and produces the 
same results, the general applicability of these findings would be strength- 
ened. 

Unlike the control subjects in Deci's experiments (1971, 1972a, 1972b), 
the subjects in the group with rewards that were not contingent upon perfor- 
mance in this experiment were aware of the program offering performance- 
contingent rewards to their co-workers. DeCharms (1968) provided one possi- 
ble explanation for the observed increase in intrinsic motivation for the 
group with rewards that were not contingent upon performance. He theo- 
rized that in the absence of contingent rewards, individuals attribute their 
behavior to internal sources and experience increases in intrinsic motivation. 

Organizational researchers may find other aspects of this experiment 
interesting. For example, the changes in intrinsic motivation occurred in a 
group incentive plan and in a real workplace. Lawler (1973) stated that 
shared incentives may negate the bad side effects he observed in individual 
incentive plans. This study's results indicate that shared incentives do not 
ameliorate the negative effects of performance-contingent rewards on intrin- 

- sic motivations. Pinder (1976) questioned whether Deci's (1971, 1972a, 
1972b) results would occur in real organizational settings, where many 
simultaneously operating reward schedules influence people. Results of this 
experiment indicate that rewards that are contingent upon performance will 
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decrease intrinsic motivation in the presence of other reward schedules that 
may be operating. 

Although the results reported are statistically significant, the magni- 
tude of the change in scores was relatively small. The Likert-type scale used 
to measure intrinsic motivation on the JDS has seven gradations. The average 
change in the different scores was 0.74, or approximately three-quarters of 
one gradation. This means that change in intrinsic motivation was from a 
high level to a moderately high level. The absence of any change in attitude 
toward pay supports DeCharms's (1968) contention that simply contracting 
to engage in an activity that is contingent upon performance reward is suffi- 
cient to undermine instrinsic motivation. 
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INFLUENCE OF TASK INTERDEPENDENCE 
AND NUMBER OF POOR PERFORMERS ON DIAGNOSES 
OF CAUSES OF POOR PERFORMANCE 


KAREN A. BROWN 
Seattle University 
TERENCE R. MITCHELL 
University of Washington 


Two studies investigated how the number of people performing poorly 
in a group and the degree of task interdependence among group members. 
influence supervisors when they diagnose the causes of poor performance. 
We chose to investigate these two factors because of their hypothesized 
relationships to the appraisal and control of performance. Figure 1 presents a 
"model that serves as a framework for describing the hypothesized influence 
of perceived interdependence of tasks and number of poor performers on 
diagnoses concerning performance. The natures of diagnoses influence out- 
comes such as appraisals of performance and corrective actions. For example, 
if supervisors diagnose something in a work environment, such as a delay in 
receiving materials, as causing poor. performance, they are likely to direct 
corrective actions toward the work system; for instance, inventory control 
. might be improved. Additionally, the performance ratings of employees 
should suffer no adverse effects under such circumstances. On the other 
hand, if supervisors see employee-centered factors such as motivation or 
ability as sources of poor performance, they will lower employees’ perfor- 
mance ratings and will direct corrective actions toward employees. Improv- 
ing feedback would be one such action. A research question arises: What 
influences supervisors to make,employee-centered diagnoses, and what influ- 
ences them to make diagnoses that center on work systems? This research 
focused on two major variables that we hypothesized to influence the direc- 
tion of diagnoses of the causes of poor performance. 


Support for this research was provided in part by National Science Foundation Grant DAR 
79-09792. The authors thank L. R. Beach, T. D. Klastorin, W. T. Newell, and T. G. Schmitt for 
their comments on the research design. They also wish to thank Bud Borders for his assistance. 
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Task Interdependence 


A number of authors have studied task interdependence and defined it . 
differently (McCann & Ferry, 1979). However, most authors seem to agree 
that it can have a significant effect on employees' productivity and the perfor- 
mance of work systems (Cheng, 1983; Dyer, Millen, & Morse, 1978), as well 
as on employees’ morale (Kiggundu, 1983)..If task interdependence influ- 
ences performance, it should be considered in appraisals of performance and 
in diagnoses of problems with performance (Kane & Lawler, 1979; Peters & 
Rudolph, 1983; Mitchell & Liden, 1982). 

In this research, we expected to find that supervisors make external 
attributions, diagnosing work environments as causing poor performance, 
when task interdependence is high. We expected them to make internal attri- 
butions, in which they diagnose characteristics of employees as causing poor 
performance, when task interdependence is low. 


Number of Poor Performers 


It is useful to consider the extent to which supervisors are sensitive to 
information about the numbers of people performing poorly in groups and 
thé extent to which they use it appropriately. The more people who are 
performing poorly, the higher the probability that something common to all 
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employees is to be blamed.! Tasks and settings are more likely than employees 
to be universal forces affecting all members of a group. Thus, we anticipated 
that supervisors would make internal diagnoses or attributions when only 
one person performed poorly and external diagnoses when several people 
performed poorly. 


Additional Relationships 


Our model also links number of poor performers and perceived task 
interdependence. Characteristics of work environments provide supervisors 
with baseline impressions of task interdependence. However, when perfor- 
mance problems occur involving several individuals, supervisors may 
increase their estimations of the extent to which tasks are interdependent in 
their groups. Over time, supervisors will continue to modify baseline impres- 
sions of task interdependence through observations about the number of 
people who typically perform poorly. 

It is also clear that supervisors will not always make rational decisions 
as they progress through such diagnostic sequences. For example, Mitchell, 
Green, and Wood (1981) argued that supervisors have a bias toward finding 
fault with employees rather than with tasks or environments. Previous 
research (Jones, 1979) has demonstrated this error in attribution and we ex- 
pected to obtain similar results. The consequence of a biased diagnosis will 
be an inappropriate corrective action or an inaccurate performance appraisal, 
either one of which can detract from future performance levels. 


METHODS: LABORATORY STUDY 


A laboratory study was designed to provide a controlled test of two 
variables, task interdependence and number of poor performers. Realistic 
stimulus materials depicted task interdependence as either high or low and 
depicted either a group or an individual as poor performers. The study had a 
2 X 2 factorial design for the independent variables, with diagnoses of causes 
of poor performance serving as dependent variables. 


. Subjects 


Subjects were 127 M.B.A. students enrolled in introductory courses on 
operations management. They provided a relatively homogeneous popula- 
tion whose members had at least some work experience (average years of 
work experience = 4.6). 


1 Kelley's (1967) work on attributions suggested that information about the number of group 
members performing poorly (called consensus information) is one of a set of performance 
comparisons that can be made. Comparisons of performance levels for an individual over time 
(called consistency information) and comparisons of an individual's performance across tasks 
(called distinctiveness information) also are important. However, Brown (1984) argued that 
consensus information is the most readily available and accurately assessed type of diagnostic 
information in work environments. 
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Procedures 


Subjects were provided with a two-page case describing a medium-sized 
electronics firm. When the subjects finished reading the case, the two inde- 
pendent variables were manipulated through information about a particular 
job. The result was a 2 x 2 design in which task interdependence and number 
of poor performers each had two conditions, high and low. We randomly 
assigned course sections to one of the four treatments. 

Next, subjects in each treatment saw a series of slides and heard a tape 
that described a job assembling circuit boards. These materials had been 
constructed from information gathered at a major electronics manufacturing 
firm. Some of the slides showed diagrams of the flows of material and infor- 
. mation for the circuit-board-assembly work center, and others were photo- 
graphs taken on the shopfloor of the electronics firm. In the accompanying 
audio tape, an employee—actually a research assistant— described his job in 
the work center. The slides and description of the task were identical in all 
conditions except for two manipulations. Half of the subjects received infor- 
mation on work flows that indicated high task interdependence and the 
other half received information indicating low task interdependence. Figure 
2 shows how flows of materials and information were depicted for both 
conditions. As this figure reveals, task interdependence was both initiated 
'and received because it flowed in two directions (Kiggundu, 1983). 

Next, subjects reviewed information about a performance problem. An 
individual employed in the work center—but not the person who had described 
the job on the tape—was depicted as having produced at 40 percent below 
expected levels of output during the previous week. A slide illustrated the 
decline in performance graphically; then a second slide provided compara- 
tive information about other employees in the workgroup. Half of the sub- 
jects saw a slide revealing that the focal employee was the only one experi- 
encing a decline in output, and half saw a slide revealing that other group 
members were experiencing similar declines. This procedure manipulated 
the variable for number of poor performers. Figure 3 shows the diagrams 
used in this manipulation.’ 

After their exposure to the two manipulations, subjects received a ques- 
tionnaire designed to assess their perceptions of the causes of the individual’s 
performance problem. Subjects rated nine items? on how likely each was to 


2 As a result, the manipulation actually blended consensus information with what Kelley 
(1967) termed consistency information. However, we considered this mix of the two types of 
information necessary for creating realistic performance charts. Also, for the focal employee 
who was being rated, the consistency information was exactly the same in both conditions. 

3 The nine items used were: To what degree do you feel that something about the employee 
(e.g., his effort or ability) is likely to have caused the performance problem; that something in the 
work environment or something about the job (e.g., problems with materials or information) 
may have caused the performance problem; that the employee's performance problem may have 
been caused by personal problems; that a delay in the arrival of incoming work materials may 
have contributed to the employee's problem; that the quality of incoming work materials may 


(Continued) 
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have caused the individual’s performance to decline. Responses were on 
5-point scales. Other items provided a manipulation check by requiring sub- 
jects to rate the extent to which they felt employees in the work center 
depended on (1) work materials and (2) information to perform their jobs. 


RESULTS: LABORATORY STUDY 


The manipulation checks were used to test the strength of the manipula- 
tion of perceived task interdependence. The mean of the two scaled items 
produced an index of perceived task interdependence. Results of a 2 x 2 
ANOVA on this index showed a main effect for task interdependence (F, 123 
= 10.93, p<.001) and no main effect for number of poor performers. This 
verified that the task interdependence manipulation had been effective. There 
was also an interaction effect (F, 423 = 5.55, p<.023). Specifically, subjects 
. revised their perceptions of task interdependence upward when confronted 
with a performance problem involving several people. This result supported 
the hypothesized moderating effect of information about number of poor 
perfarmers on perceived task interdependence. 


Testing Hypotheses 


A two-way multivariate analysis of variance between subjects was per- 
formed on scaled responses to questions about the causes of the decline in 
performance. The analysis revealed significant effects for task interdepend- 
ence (Fg 442 = 3.197, p<.003); number of poor performers (Fg 4, — 117.69, 
p<.001); and the interaction term (Fg 4, —2.58, p<.013). 

As an aid to the interpretation of the significant MANOVA results, we 
calculated mean scores on items frcm the questionnaire to create two indices. 
One index was based on responses to the two items suggesting employee- 
centered or internal causes (r= .38, p<.001). The other was based on responses 
to the seven items suggesting external causes related to the work system 
(a 2.776). The mean scores for t3ese two indices were not significantly 
different, demonstrating that the anticipated bias toward employee-centered 
diagnoses did not occur. We next subjected the two indices to a two-way 
analysis of variance. 

The F-statistics for the cell means shown in Table 1 reveal that both 
number of poor performers (F4 4560.45, p«.001), and task interdepend- 
ence (F, 123 76.34, p<.013) influenced diagnoses that centered on employees. 
The effects were in the hypothesized direction: the individual employee was 
more likely to be blamed if there was only one poor performer or if task 


have contributed to the employee's problem; that problems encountered after completed circuit 
boards leave the work center (e.g., inspection errors or breakage) may have contributed to the 
employee's problem; that delays in the arrival of work-related information may have contrib- 
uted to the employee's problem; that prob'ems with the accuracy or quality of incoming work 
information may have contributed to the employee's problem; that problems encountered after 
information leaves the work center (e.g., keypunching errors or lost records) may have contrib- 
uted to the employee's problem? 


1986 Brown and Mitchell 


FIGURE 2 
Task Interdependence Resource Flows: Laboratory Study 
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TABLE 1 
Mean Ratings for Cells: Laboratory Study 





toa 


b 


Task Interdependence” 
Number of Poor Performers® Low High Overall 
(a) Index of Employee-Related Causes (Internal) 
One 3.59 3.03 3.32 
: , n=33 n=31 n=64 
Several 2.32 2.17 2.27 
n=37 n=26 n=63 
Overall 2.92 2.65 
n=70 n=57 
(b) Index of System-Centered Causes (External) 
` One 2.11 2.72 2.41 
n=33 n=31 n=64 
Several .3.10 : 3.03 3.07 
n=37 n=26 n=63 
Overall 2.63 2.86 
n=70 n=57 , 
TE, 423 = 60.45, p<.001 for internal causes; P, ,.,= 42.48, p<.001 for external causes. 


11237 9.394, p<.013 for internal causes; F, ,,,— 7.22, p<.003 for external causes. 


interdependence was low. There was no interaction effect for the employee- 
centered causes. 

Both independent variables also significantly affected system-related 
diagnoses (number of poor performers, F4 123 = 42.48, p<.001; task interde- 
pendence, F, 123 = 7.22, p<.008). These results were also in the hypothesized 
direction. Causes related to the work system (external attributions) received 
their highest ratings when there were several poor performers or when task 
interdependence was high. 

In addition to the two main effects, there was a significant interaction 
effect (F4 123 — 10.34, p<.002) on the index of system-related diagnoses. The 
interaction term shows that task interdependence influenced attributions of 
poor performance for the individual more than it influenced attributions of 
poor performance for the group. This suggests that when several people are 
performing poorly, information about task interdependence contributes lit- 
tle to diagnoses of the causes of the poor performance because information 
about the number of poor performers has already pointed to the work systein 
as the cause of the problem. 


FIELD STUDY 


A field study was conducted to assess the extent to which task interde- 
pendence and information about number of poor performers influence 
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supervisors’ diagnostic judgments in actual work settings. Additionally, it 
provided an opportunity for exploring manufacturing supervisors’ opinions 
about the causes of declining output. 


Respondents 


The field study involved one-hour interviews with supervisors and 
managers in 12 manufacturing firms. We chose this group from the respon- 
dents to a mail survey; the firms represented a variety of industries, includ- 
ing producers of electronics equipment, garments, ships, industrial equip- 
ment, wood products, graphics, and industrial plating. 

From one to six people were interviewed in each firm, depending on its 
size and on the availability of personnel for participation in the study. In 
total, we interviewed 46 people. 


Measures 


In each firm, a specific semiskilled task was selected as the focus for all 
responses. During the first part of each interview, respondents rated the 
degree of task interdependence associated with the focal tasks. They rated 
interdependence between and within work units with regard to materials, 
machinery, scheduling information, specialized manpower, and teamwork. 
All scales had 5-point response formats. 

Next were scaled items designed to tap respondents’ personal Ges 
regarding the causes of poor performance. Two scenarios were described: 
one in which an individual was performing a focal task at 20 percent below . 
the expected level of output, and one in which an entire work group was 
performing the task at 20 percent below the expected level. 

Respondents rated eight causal factors as to the likelihood of their hav- 
ing contributed to each of the two. performance problems. We adapted these 
causal factors from Buffa’s (1976) description of assignable causes and from 
Deming’s (1975) listing of the common causes of variation in performance. 
Using a 5-point response format, respondents rated each factor as (1) a cause 
of the individual's poor performance and (2) a cause of the group's poor 

. performance. 


RESULTS: FIELD STUDY 


The mean ratings for the eight causal factors were, in order of magnitude: 
motivation, 3.22; quality of materials, 2.95; training, 2.94; ability, 2.58; delays 
of materials, 2.58; performance of machines or equipment, 2.51; capacity of 
machines, 2.50; scheduling, 2.23. Thus, respondents saw motivation as the 
most likely cause of poor performance. Aggregations of the employee-centered 
causes (motivation, training, ability) and the causes centered on work sys- 
tems (delays in materials, quality of materials, performance of machines, 
capacity of machines, and scheduling) showed that respondents saw the 
employee-centered factors (x — 2.91) as significantly more important than the 
work-system related factors (X= 2.50; F4 45 = 7.8; p<.01). As predicted, manag- 
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ers and supervisors saw employees, rather than environment, as the cause of 
performance problems. 


Testing Hypotheses 


Respondents were separated into two groups on the basis of their mean 
responses on the ten items measuring interdependence (a —.743). We made 
the split at the median. The aggregated measure used to split the two groups 
(X=3.45 and 4.44) was significantly higher in one group than in the other 
(F4 447 83.97, p<.001). We performed a multivariate analysis of variance on 
responses to the eight scaled items about the causes of declining output. The 
model was based on 2 X 2 design in which task interdependence was high or 
low, and number of poor performers was all or one. Task interdependence 
was a categorical variable and number of poor performers was a manipulated 
variable. Multivariate effects for task interdependence were not significant. 
There were no interaction effects. However, the multivariate result for infor- 
mation about number of poor performers was significant (Fg 3, = 8.02, p<.001). 

A simplified analysis based on aggregated indices of employee-centered 
and system-related causes was performed to examine the general effects of 
the two independent variables. We entered the indexed variables into a 2 x 2 
ANOVA model as dependent variables. Table 2 reports means for cells. The 
index for employee-centered causes (a —.60) was influenced significantly by 
information about the number of poor performers (F, ,, —48.28, p<.001), 
but not by task interdependence. Information about the number of poor 
performers (F4 44= 18.2, p«.001) influenced the index for system-related 
causes (a = .76), and task interdependence had a near significant effect 
(F4 4473.6, p<.07). Neither index resulted in an interaction effect. 


DISCUSSION 


To summarize the results of the two studies, number of poor performers 
affected diagnoses of reasons for poor performance significantly in both the 
laboratory and field studies. Task interdependence had its only significant 
effect in the laboratory study in which graphic displays and job descriptions 
made its presence clear. The lack of a significant effect for task interdepend- 
ence in the field has several possible explanations. First, supervisors may 
not pay enough attention to the variable. Additionally, there may be a defen- 
sive element to this lack of acknowledgment. Supervisors may view them- 
selves as responsible for moderating the effects of task interdependence, but 
avoid including it in diagnoses because doing so would point some blame at 
them (Jones, 1979). 

Also, task interdependence was not manipulated in the field study, and 
differences between the high and low groups may not have been sufficient to 
produce differences in diagnoses. The range of interdependence may have 
been restricted because all respondents focused on semiskilled tasks that 
had similar characteristics. 

It is difficult to determine the true cause of the weakness of the effects 
for interdependence in the field study. However, the methodology and results 
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' TABLE 2 
Mean Ratings for Cells: Field Study 





, Perceived Task Interdependence? 
Number of Poor Performers? Low High Overall 








(a). Index of Employee-Related Causes (Internal) l 
One 3.25 3.48 3.36 
n=23 n=22 n=46 
Several ETS l 2.51 2.45 
n=23 n=23 n=46 
Dverall 2.82 3.00 
n= 23". n=23* 
(b) Index of System-Related Causes (External) 
One 2.17 2.31 2.24 
n=23 ` n-32 n=46 
Several | . 2.54 3.00 ` 2.77. 
. i n=23 n=23 n=46 
Overall 2.30 | 2.71 
n= 23* n=23* 





S F, 14 7 44.28, p<.001 for internal causes; P, = 18.2, p<.001 for external causes. 


b Non-significant for internal causes; Paar 3.6, p<.07 for internal causes. 
*Number of poor performers was a within-subjects manipulation, so the overall n is the same 
as each of the cell n’s. 


of the laboratory study demonstrated that when task interdependence is 
clearly described and subjects have no responsibility for buffering its effects 
on employees’ performance, they will use it in diagnoses of causes of poor 
performance. 


CONCLUSIONS 


This- sach makes some contributions and suggests a need for further 
study.: We -demonstrated both in the laboratory and the field, studies that 
information about the number of poor performers has a significant effect on 
diagnoses of the causes of poor performance. Previous research has demon- 
strated these effects in educational settings (MacArthur, 1971), but they have 
not been demonstrated previously with respect to performance diagnoses in 
work settings. The interdependence effects that emerged in the laboratory. 
also offer a new contribution in the area of performance diagnosis. 

Another contribution comes in the form of a nonsignificant finding. it. 
was a surprise that interdependence did not affect the supervisors' percep- 
tions of the causes of performance problems in the field study. Although we 
gave some possible post hoc explanations for this finding, we still do not 
-have sufficient evidence to choose among these alternatives. Subsequent 
work is needed in this area. 
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In addition to the finding related to the effects of the two information 
sources, the data from the field study showed a general tendency toward 
internal attributions. This confirms results from other studies that have dem- 
onstrated observers’ propensities to blame people rather than environments 
for performance problems. One possible implication is that improvement in 
productivity will come through a focus on changing employees’ abilities, 
levels of motivation, attitudes, and so forth. However, previous research 
provides some evidence that increased awareness of social, task, situational, 
technological, and environmental limitations on performance is needed 
(Peters, O'Connor, & Rudolph, 1980). 

It also appears that supervisors may be reluctant to use information on 
task interdependence. Until organizations remedy this problem, perhaps 
through training, it is likely that diagnoses of the causes of poor performance 
will continue to focus on individuals and possibly continue to be inaccurate 
or incomplete. This research highlights this point by demonstrating that 
only when information on task interdependence was made highly salient in 
the laboratory study did it have a noticeable effect on diagnosis. Future work 
needs to focus not only on what information people use in making diagnoses, 
but also what information they should use. 
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ADDITIVE INFORMATION PROCESSING 
AND THE RELATIONSHIP BETWEEN EXPECTANCY 
OF SUCCESS AND MOTIVATIONAL FORCE 


ADRIAN HARRELL 
University of South Carolina 
MICHAEL STAHL 
Clemson University 


In the force model posited by expectancy theory (Vroom, 1964), the level 
of motivational force acting on a person is determined by multiplying the 
model's valence and expectancy variables. Logically, this implies that 
increases in expectancy result in proportionate increases in motivation, 
However, some recent research (Kahneman & Tversky, 1979; Rynes & Lawler, 
1983) has suggested that increases in expectancy may result in dispro- 
portionately small increases in motivation. In order to consider this issue, 
we employed data from five groups to examine the proposal that increases in 
the expectancy of success result in declining marginal increases in motiva- 
tional force. 

For more than two decades, expectancy theory (Vroom, 1964) has been 
one of the more popular theories of motivation (Connolly, 1976; Heneman & 
Schwab, 1972; House, Shapiro, & Wahba, 1974; Locke, 1975; Mitchell, 1974, 
1979; Mitchell & Beach, 1977; Schwab, Olian-Gottlieb, & Heneman, 1979; 
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Stahl & Harrell, 1981, 1983; Wahba & House, 1974), with most research 
efforts focusing on the force model. Similarly, the expected utility model 
has long been popular among economists and researchers concerned with 
normative decision theory (Keeney & Raiffa, 1976; Samuelson, 1967) for 
explaining human behavior. A number of researchers have noted the close 
correspondence between these two models (e. g., Harrell & Stahl, 1984; Mitch- 
ell & Beach, 1977; Vroom, 1964: 19). The expectancy variable in the force 
model and the expectation variable in the expected utility model both refer 
to the probability that a choice or act will lead to an outcome. Valence and 
utility both refer to affective orientations toward outcomes (Samuelson, 1967: 
417—419; Vroom, 1964: 15). In addition, both models imply that individuals 
employ multiplicative information processing. The models’ premise is that 
people multiply valence (utility) by expectancy (expectation) to determine 
levels of motivation (expected utility); this implies that increases in expec- 
tancy (expectation) should result in proportionate increases in motivation 
(expected utility). However, investigators have criticized both the expec- 
tancy force model and the expected utility model. Methodological and mea- 
surement difficulties experienced in early research on expectancy theory, 
such as the problems discussed by Kopelman (1977), Schmidt (1973), and 
Wolf and Connolly (1981), may have resulted in some decline in the force 
model’s popularity. Similarly, the descriptive validity of the expected utility 
model has been questioned over the years (e. g., Allais, 1953; Coombs, 1975; 
Elisberg, 1961), with recent criticism from Kahneman and Tversky (1979). 
The close correspondence of these models suggests that criticisms of one 
model logically also apply to the other. 

‘Mitchell and Beach (1977) and Zedeck (1977) suggested that behavioral 
decision theory could be used to avoid the problems experienced by earlier 
expectancy theory researchers. Acting on these suggestions, in two previous 
studies (Harrell & Stahl, 1984; Stahl & Harrell, 1981) we used judgment 
analysis to examine the premise of multiplicative information processing 
implicit in the expectancy force model. We concluded that most of the 
individuals studied employed additive information processing to arrive at 
their decisions concerning motivation, with a minority employing the multi- 
plicative information processing assumed by the force model. Accordingly, 
we proposed the force model be modified to acknowledge these individual 
differences in information processing (Stahl & Harrell, 1981). Rynes and 
Lawler (1983), replicating and extending our (1981) research, employed sev- 
eral different analysis techniques and also concluded that only a few of their 
participants employed multiplicative information processing. Butler and 
Womer (1985), employing an analysis technique different from those that 
either we or Rynes and Lawler used, also concluded that a minority of their 
participants employed the multiplicative information processing assumed 
by the force model. We suggested that our (1981) results implied that individu- 
als with high levels of education in mathematics and statistics appear more 
likely to employ multiplicative information processing than are others. But- 
ler and Womer (1985) tested this proposal and concluded that people's facil- 
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ity with mathematics could be used to predict their use of either multiplica- 
tive or additive information processing to arrive at decisions concerning 
motivation. Since both the expectar.cy force model and the expected utility 
model imply that individuals employ multiplicative information processing, 
these results logically apply to botk models. 

Kahneman and Tversky (1979), after presenting criticisms of the expected 
utility model that also logically apply to the expectancy force model, pro- 
posed an extension of expected utility theory called prospect theory. Pros- 
pect theory incorporates many. of the empirical findings presented in the 
literature on behavioral decision theory (Slovic, Fishhoff, & Lichtenstein, 
1977; Slovic & Lichtenstein, 1971). Prospect theory assumes two phases, an 
initial editing phase and an evaluation phase. During the editing phase, 
people eliminate some alternatives and frame acts, outcomes, and contin- 
gencies. During the evaluation phese, they evaluate prospective outcomes 
and make choices. The prospect theory model, which has nót yet been empiri- 
cally tested, is intended to describe how individuals make choices during 
the evaluation phase. As with the expectancy force model and the expected 
utility model, the prospect theory model assumes that individuals employ 
multiplicative information processing when they evaluate prospective 
outcomes. Á principal difference between the prospect theory model and the 
traditional expected utility model is that prospect theory assumes utility to 
be multiplied by a weight that is'a nonlinear function of expectation, rather 
than by the actual value of the expectation variable. Kahneman and Tversky 
presented a hypothetical weighting function (1979: 283) that suggests indi- 
viduals tend to overweight low prooabilitiesand underweight moderate and 
high probabilities. They also presented an S-shaped value function that is 
convex for negative nonzero outcomes and concave for positive nonzero 
outcomes (1979: 279). Since they hypothesized that the part of the value 
function relating to positive nonzerc outcomes! is concave, increases in expec- 
tation should result in choices that imply declining marginal increases in 
expected utility. On the basis of findings presented in their research on 
expectancy theory, Rynes and Lawler (1983) proposed similar relationships 
between the expectancy of success and motivational force. 


METHODS 


This study employed data from five diverse groups to examine the pro- 
posal that increases in the expectancy of success result in declining marginal 
increases in motivation. Although the focus of this research was upon the 
expectancy force model, the results logically apply to the expected utility 
model and in some instances to the prospect theory model as well. We 
previously used the data employed in this study to examine (1) the expec- 
tancy force model's premise of multiplicative information processing, and 


*The upper right quadrant of Figure 1 in Tversky and Kahneman (1981: 454) depicts this 
part of the value function. 
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(2) the force model's ability to explain the decisions reached by individuals 
(Harrell & Stahl, 1984; Stahl & Harrell, 1981). Those studies did not consider 
the relationship between expectancy of success and motivational force, the 
issue of primary interest here. 

Our previous research described how the judgment analysis approach 
that those researching behavioral decision theory have employed can be 
used to perform research on expectancy theory. Thus, we present only a brief 
description of this process. With the judgment analysis approach, each par- 
ticipant must complete a series of decision making cases, in which the ele- 
ments of each case are designed to correspond to the various elements of the 
valence and force models of expectancy theory. Factorial designs are incorpo- 
rated into the information presented in the cases. Initially, each case pre- 
sents a number of second-level outcomes and their relationships to the first- 
level outcome of interest. On the basis of this information, each subject 
reaches a decision as to the attractiveness (valence) of the first-level outcome. 
This decision provides an operational measure of the valence of the first- 
level outcome. After the expectancy of success is presented, the subject then 
reaches an effort-level decision that provides an operational measure of the 
motivational force acting upon that individual under the particular circum- 
stances of the case. Data are analyzed separately for each individual. Usually, 
regression analysis is employed for the valence model, with the beta weight 
associated with each second-level outcome indicating its valence (Stahl & ` 
Harrell, 1983). Hierarchical regression analysis (Arnold & Evans, 1979; Cohen, 
1978; Cohen & Cohen, 1975; Harrell & Stahl, 1984; Stahl & Harrell, 1981) is 
usually employed to examine the force model's multiplicative premise. 

Five groups of subjects completed four exercises in decision making, 
with different groups doing different exercises. The exercises provided oper- 
ational measurements of (1) motivation to get a particular job (groups 1 and 
2), (2) motivation to accomplish budgeted cost goals (group 3), and (3) moti- 
vation to earn an “A” in an academic course (groups 4 and 5). Our earlier 
studies provide detailed descriptions of the subjects and the decision mak- 
ing exercises (Harrell & Stahl, 1984; Stahl & Harrell, 1981). The situations 
presented in all four exercises involved only positive nonzero outcomes and 
therefore related only to the concave portion of the prospect theory value 
function (Kahneman & Tversky, 1979: 289; Tversky & Kahneman, 1981: 454). 


RESULTS 


The results of the data analysis are presented separately for each of the 
five groups. 

In group 1, 59 men who were seniors in accounting at a southwestern 
university completed a decision making exercise, with each subject's effort- 
level decision values providing operational measurements of his motivation 
to get a particular job. Each subject provided eight decision values for each of 
three expectancy levels (0.00, 0.30, and 0.60). Since a factorial design was 
incorporated into the exercise, the information upon which the three sets of 
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eight decisions were based differed only as to the expectancy levels involved. 
Therefore, if increases in expectancy (expectation) result in declining mar- 
ginal increases in the value of motivational force, the differences between 
each subject's corresponding decision values when expectancy equaled 0.30 
and when expectancy equaled 0.00 will be greater than the differences 
between each subject's corresponding decision values when expectancy 
equaled 0.60 and when it equaled 0 30. In this and in subsequent analyses, 
we used a paired-samples t-test to exemine this hypothesis.? The results (df = 
471, t = 4.6, p < .0001) support the proposal that increases in expectancy 
result in declining marginal increas2s in motivation. 

In group two, 75 junior and senior administrative management students 
at a southeastern university completed a decision making exercise, with 
each subject's effort-level decision values providing operational measure- 
ments of the person's motivation to get a particular job. Each subject pro- 
vided eight decision values for each of three expectancy levels (0.05, 0.50, 
and 0.95). If, as proposed, increases in expectancy result in declining mar- 
ginal increases in motivational force, the differences between each subject's 
corresponding decision values when expectancy equaled 0.50 and when it 
equaled 0.05 will be greater than the differences between each subject's 
corresponding decision values when expectancy equaled 0.95 and when it 
equaled 0.50. The results (df — 591,t — 4.4, p « .0001) support the proposal 
that increases in expectancy result in declining marginal increases in 
motivation. 

In group 3, 26 working managers in a southeastern city completed a 
decision making exercise, with each subject’s effort-level decision values 
providing operational measuremerts of the person's motivation to accom- 
plish various projects within budgeted cost goals. Each subject provided 
eight decision values for each of three expectancy levels (0.10, 0.50, and 
0.90). If, as proposed, increases in expectancy (expectation) result in declin- 
ing marginal increases in motivational force, the differences between each 
subject's corresponding decision values when expectancy equaled 0.50 and 
when expectancy equaled 0.10 will be greater than the differences between 
each subject's corresponding decision values when expectancy equaled 0.90 
and when it equaled 0.50. The results (df = 207,t = 4.1, p < .001) support the 
proposal that increases in expectancy result in declining marginal increases 
in motivation. 

In group 4, 48 graduate students in engineering, management, or opera- 
tions research at a midwestern university completed a decision making 


?An alternative approach for analyzirg these data is to compute the average difference 
between the eight decisions each subject reached when expectancy equaled 0.30 and when 
expectancy equaled 0.00 and the average difference between the eight decisions each subject 
reached when expectancy equaled 0.30 and when expectancy equaled 0.60. The paired-samples 
t-test may then be used to compare these av2rage differences. The study results presented in this 
paper are statistically significant if this alternative analysis approach is used. 
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exercise, with each subject’s effort-level decision values providing opera- 
tional measurements of the person's motivation to earn an "A" in an aca- 
demic course. Each subject provided eight decision values for each of three 
expectancy levels (0.00, 0.40, and 0.80). If, as proposed, increases in expec- 
tancy result in declining marginal increases in motivational force, the differ- 
ences between each subject's corresponding decision values when expec- 
tancy equaled 0.40 and when it equaled 0.00 will be greater than the 
differences between each subject's corresponding decision values when expec- 
tancy equaled 0.80 and when expectancy equaled 0.40. The results (df — 383, 
t = 5.2, p < .0001) support the proposal that increases in expectancy result in 
declining marginal increases in motivation. 

In group 5, 20 graduate students in management from the same midwest- 
ern university as group 4 came from completed the same decision making 
exercise, with each subject's effort-level decision values providing opera- 
tional measurements of that person's motivation to earn an “A” in an aca- 
demic course. Each person provided eight decision values for each of three 
expectancy levels (0.00, 0.20, and 0.60), with two of these expectancy levels 
differing from those of group 4. If, as proposed, increases in expectancy 
result in declining marginal increases in motivation, the differences between 
each subject's corresponding decision values when expectancy equaled 0.20 
and when expectancy equaled 0.00 will be greater than one-half the differ- 
ences between each subject's corresponding decision values when expec- 
tancy equaled 0.60 and when expectancy equaled 0.20. The results (df — 159, 
t = 2.1, p < .02) support the proposal that increases in expectancy result in 
declining marginal increases in motivation. Figure 1 summarizes the results 
for all five groups. 


DISCUSSION 


In a number of instances, the subjects’ decisions differed from those 
implied by the expectancy force model, the expected utility model, or the 
prospect theory model. Since all three of these models contain the premise 
that individuals employ multiplicative information processing to arrive at 
their decisions, all three models imply that only a very small level of motiva- 
tion should exist when the expectancy of success is nearly zero. They further 
imply that no motivation should exist when the expectancy of success is 
zero. However, as Figure 1 shows, many of the subjects’ decisions indicated 
that a significant level of motivation existed when the expectancy of success 
was quite small (0.05 and 0.10) and that some positive level of motivation 
existed even when the expectancy of success was zero. 

Previous research provides a possible explanation of these findings 
(Harrell & Stahl, 1984; Stahl & Harrell, 1981}. Results of that research indi- 
cate that most of the subjects who provided our data employed additive 
information processing to arrive at their decisions, rather than the multiplica- 
tive information processing implied by the expectancy force model, the 
expected utility model, and the prospect theory model. If individuals employ 
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FIGURE 1 
Expectancy and Motivational Force Relationships 
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additive information processing to arrive at decisions concerning motivation, 
they should be expected to experience some level of motivation even if 
expectancy is very small or even zero when the valence of an outcome is 
large (Stahl & Harrell, 1981: 321). Since additive linear models usually explain 
nearly all of the variance in the decisions reached by most individuals even 
when their actual information processing is multiplicative (Harrell & Stahl, 
1984; Stahl & Harrell, 1981; Slovic, Fishhoff, & Lichtenstein, 1977; Slovic & 
Lichtenstein, 1971), it seems possible the findings reported in this paper are 
typical of most people. Using this rationale, we might explain the empirical 
results provided by Kahneman and Tversky (1979: 265—273), who appar- 
ently assumed that their subjects employed the multiplicative information 
processing implied by the prospect theory model. If most of their subjects 
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actually employed additive information processing and associated a large 
valence or utility with certain outcomes, we would expect them to have 
reached choices that neither the multiplicative expected utility model nor 
the multiplicative expectancy force model predict. The research of Matsui, 
Okada, and Mizuguchi (1981) appears to support this potential explanation 
for such behavior. They concluded that levels of motivation are, in some 
instances, more strongly influenced by the valence of an outcome than by 
the outcome’s associated expectancy. 

The data provided by all five groups of participants consistently support 
the proposal that increases in the expectancy of success result in declining 
marginal increases in individuals’ levels of motivation. The study’s results 
therefore provide straightforward support for Rynes and Lawler’s (1983) 
suggestion about these relationships. Since the cases in the exercises used to 
gather the data involved positive nonzero outcomes, the results also provide 
support for the general relationships implied by the concave portion of 
Kahneman and Tversky’s (1979) prospect theory value function. 


CONCLUSIONS 


To summarize, recent research on expectancy theory (Butler & Womer, 
1985; Harrell & Stahl, 1984; Rynes & Lawler, 1983; Stahl & Harrell, 1981) has 
indicated that many, perhaps most, people employ additive information 
processing to arrive at decisions concerning motivation rather than the multi- 
plicative information processing implied by the expectancy force model. 
This finding may account for the results presented in this study indicating 
that (1) many individuals exhibit a significant level of motivation even when 
their expectancy of success is very small or zero; and (2) increases in the 
expectancy of success result in declining marginal increases in motivation 
rather than in the proportionate increases implied by the expectancy force 
model. 

Given these findings, a question might arise as to whether the expec- 
tancy force model, as it is commonly formulated and applied in motivation 
theory, is still useful. We believe that it is still useful as a conceptual model 
for explaining the factors that motivate individual behavior. For example, 
we believe it is correct to state that an individual’s level of motivation is a 
function of the force model’s valence and expectancy variables (Vroom, 
1964), However, we believe the explanatory and predictive ability of the 
expectancy theory models would be substantially improved by integrating 
the empirical findings presented in the literature on behavioral decision 
theory into expectancy theory. Accordingly, we propose that some assump- 
tions of prospect theory (Kahneman & Tversky, 1979; Tversky & Kahneman, 
1981) be integrated into expectancy theory. These assumptions are that indi- 
viduals (1) initially perform an editing activity during which they eliminate 
some alternatives and frame acts, outcomes, and contingencies, and (2) subse- 
quently perform an evaluation activity during which they evaluate the remain- 
ing alternatives. We further propose that the expectancy valence model can 
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effectively explain or predict the valence individuals associate with first- 
level outcomes after they have performed the initial editing activity. Moreover, 
we propose that the expectancy force model, modified to acknowledge that 
many individuals employ additive information processing, can be effec- 
tively employed to explain or predict the level of motivational force acting 
on an individual. As noted earlier, a large body of prior research indicates 
that additive linear models usually explain nearly all of the variance in the 
choices reached by most individuals even when their actual information 
processing is multiplicative (Harrell & Stahl, 1984; Stahl & Harrell, 1981; 
Slovic, Fishhoff, & Lichtenstein, 1977; Slovic & Lichtenstein, 1971). Accord- 
ingly, if an individual has performed the initial editing activity described by 
prospect theory (Kahneman & Tversky, 1979; Tversky & Kahneman, 1981), 
we would expect the additive form of the expectancy force model proposed 
in our earlier research (Stahl & Harrell, 1981: 321) will nearly always ex- 
plain adequately the individual’s choice behavior. We hope these proposals 
will stimulate other researchers to further examine these issues. 
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